commoncrawl.org/blog/common-...
commoncrawl.org/blog/common-...
Joint work by DFKI SLT incl. Fabio Barth, Raia Abu Ahmad, @malteos.bsky.social @pjox.bsky.social
Joint work by DFKI SLT incl. Fabio Barth, Raia Abu Ahmad, @malteos.bsky.social @pjox.bsky.social
Registering is easy! All the details are on the shared task webpage: wmdqs.org/shared-task/
Deadline: July 23, 2025 (AoE) ⏰
Registering is easy! All the details are on the shared task webpage: wmdqs.org/shared-task/
Deadline: July 23, 2025 (AoE) ⏰
commoncrawl.org/blog/the-fir...
commoncrawl.org/blog/the-fir...
commoncrawl.org/blog/common-...
commoncrawl.org/blog/common-...
If you are in NYC, it would be great to see you there!
lu.ma/p0a1scde
If you are in NYC, it would be great to see you there!
lu.ma/p0a1scde
We are organising the 1st Workshop on Multilingual Data Quality Signals with @mlcommons.org and @eleutherai.bsky.social, held in tandem with @colmweb.org. Submit your research on multilingual data quality!
Submission deadline is 23 June, more info: wmdqs.org
We are organising the 1st Workshop on Multilingual Data Quality Signals with @mlcommons.org and @eleutherai.bsky.social, held in tandem with @colmweb.org. Submit your research on multilingual data quality!
Submission deadline is 23 June, more info: wmdqs.org
Please donate if you can! Every donation no matter how small, helps immensely.
marathon-paris.dossards-solidaires.org/fundraisers/...
Please donate if you can! Every donation no matter how small, helps immensely.
marathon-paris.dossards-solidaires.org/fundraisers/...
@nettarkivet.bsky.social | #iipcGA25 | #webarchiving
@nettarkivet.bsky.social | #iipcGA25 | #webarchiving
Thank you to the @commoncrawl.bsky.social Foundation for all their hard work. Onwards! @pjox.bsky.social - So great to meet in person.
Thank you to the @commoncrawl.bsky.social Foundation for all their hard work. Onwards! @pjox.bsky.social - So great to meet in person.
cc-downloader is still under active development, so if you find any issues or would like to submit a feature request, please visit its GitHub repository at github.com/commoncrawl/....
cc-downloader is still under active development, so if you find any issues or would like to submit a feature request, please visit its GitHub repository at github.com/commoncrawl/....
marathon-paris.dossards-solidaires.org/fundraisers/...
marathon-paris.dossards-solidaires.org/fundraisers/...