Legislative Text Processing

Legislative text processing, data visualization

Legislative texts (laws, decrees, verdicts, etc.) are more formalised than those of ordinary language. From a linguistic aspect they may be characterised by the rigorous terminology, the rigid syntax of the cross references to other laws. Because of these, it is quite uncomfortable to read those texts even for the expert (not to mention the layman). The semi-automatic processing we have developed is based on those features of legislative texts. After processing the law texts we can automatically

find fragmented parts (numbered or bulleted lists, incomplete sentences) and make them complete;
recognize the internal hierarchical structure (books, chapters, sections, etc.);
find and address the internal references (i.e., links to other parts of the very same document).

The results of those analyses are also displayed, where it is possible, by data-visualization tools.

Legislative corpus

In order to be able to perform automatic or semi-automatic analyses, we have created a corpus of legal texts. The corpus currently contains nearly fifty laws. The raw texts of various types of legislation were databased so that a structural unit (section) constitutes a db-record. The database form allows to perform various analysis. To display the results of them we use data-visualization tools.

legislative corpus (Hungarian) ()

Researchers


	Gábor Hamp	Réka Markovich	István Szakadát

Articles

Hamp Gábor, Syi, Markovich Réka (2016). Jogszabályok hivatkozásainak automatikus felismerése és a belső hivatkozások struktúrája. In: Tanács Attila, Varga Viktor, Vincze Veronika (szerk.) XII. Magyar Számítógépes Nyelvészeti Konferencia. pp. 220-229.
Markovich, Réka, Syi, Hamp, Gábor (2015). Elliptical Lists in Legislative Texts. Proceedings of Fifteenth International Conference on Artificial Intelligence and Law (ICAIL 2015) ACM, pp. 192-195.
Markovich Réka, Hamp Gábor, Syi (2015). Jogszabályszövegek gépi elemzésének tanulságai JOGELMÉLETI SZEMLE (2) pp. 64-73.
Syi, Hamp Gábor, Markovich Réka (2015). Goody-listák jogszabályszövegekben. Három tételben. JEL-KÉP: KOMMUNIKÁCIÓ, KÖZVÉLEMÉNY, MÉDIA (3) pp. 13-24.
Hamp Gábor, Syi, Markovich Réka (2015). Elliptikus listák jogszabályszövegekben. In: Tanács Attila, Varga Viktor, Vincze Veronika (szerk.) XI. Magyar Számítógépes Nyelvészeti Konferencia. pp. 273-280.
Markovich Réka, Hamp Gábor, Syi (2014). A kondicionálisok problémája jogszabályszövegekben. In: Tanács Attila, Varga Viktor, Vincze Veronika (szerk.) X. Magyar Számítógépes Nyelvészeti Konferencia. pp. 295-302.