The corpus isn’t just readable 👁️ — it’s also fully downloadable!
Now hosted on @hf.co :
🧾 JSONL dataset → huggingface.co/datasets/com...
📂 More formats (ALTO, TEI, etc.) coming soon — we’re uploading the GBs as we speak.
The corpus isn’t just readable 👁️ — it’s also fully downloadable!
Now hosted on @hf.co :
🧾 JSONL dataset → huggingface.co/datasets/com...
📂 More formats (ALTO, TEI, etc.) coming soon — we’re uploading the GBs as we speak.