Corpus of Legal Texts
A corpus of legal regulations of the Slovak Republic legal-1.0 was released in 2011 containing 146 million tokens. Prepared in collaboration with the Ministry of Justice of the Slovak Republic.
Deduplicated corpus legal-1.1 – duplicate content removed – contains 48 977 876 tokens.