Home

Grex: Automatic Grammar Extraction #

Grex is a tool for automatic grammar rule extraction from treebanks using machine learning.

It was conceived by Santiago Herrera, Caio Corro, Bruno Guillaume and Sylvain Kahane. A full description of the method can be found in the paper (the code is deprecated): https://aclanthology.org/2024.lrec-main.1314/

The maintained code is here.

If you use this software, please cite the following work:

@inproceedings{herrera-etal-2024-sparse,
    title = "Sparse Logistic Regression with High-order Features for Automatic Grammar Rule Extraction from Treebanks",
    author = "Herrera, Santiago and Corro, Caio and Kahane, Sylvain",
    editor = "Calzolari, Nicoletta and Kan, Min-Yen and Hoste, Veronique and Lenci, Alessandro and Sakti, Sakriani  and Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.1314/",
    pages = "15114--15125"
}

If you use Grex for contrastive studies, please cite the most pertinent of the following papers:

@inproceedings{herrera_et_al2024contrastive,
  author = {Santiago Herrera and Ioana-Madalina Silai and Caio Corro and Bruno Guillaume and Sylvain Kahane},
  title = {Building quantitative contrastive grammars from syntactic treebanks},
  booktitle = {Langues \& Langages ร  la croisรฉe des Disciplines (LLcD)},
  year = {2024},
  address = {Paris}
}
@inproceedings{herrera-etal-2025-extraction,
    title = "Extraction of Contrastive Rules from Syntactic Treebanks: A Case Study in {R}omance Languages",
    author = "Herrera, Santiago  and Silai, Ioana-Madalina and Corro, Caio and Guillaume, Bruno  and Kahane, Sylvain",
    editor = "Chen, Xinying  and Wang, Yaqin",
    booktitle = "Proceedings of the Third Workshop on Quantitative Syntax (QUASY, SyntaxFest 2025)",
    month = aug,
    year = "2025",
    address = "Ljubljana, Slovenia",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.quasy-1.5/",
    pages = "26--38",
    ISBN = "979-8-89176-293-0"
}