The Webis Group contributes to information retrieval, natural language processing, machine learning, and symbolic AI.
As for German-specific models trained on this data... stay tuned 👀
As for German-specific models trained on this data... stay tuned 👀
🌐 Web: Wikipedia, GitHub, social media
💬 Political: Parliamentary proceedings, speeches
⚖️ Legal: Court decisions, federal & EU law
📰 News: Newspaper archives
🏦 Economics: public tenders
📚 Cultural: Digital heritage collections
🔬 Scientific: Papers, books, journals
🌐 Web: Wikipedia, GitHub, social media
💬 Political: Parliamentary proceedings, speeches
⚖️ Legal: Court decisions, federal & EU law
📰 News: Newspaper archives
🏦 Economics: public tenders
📚 Cultural: Digital heritage collections
🔬 Scientific: Papers, books, journals
✅ Every document has verifiable usage rights (min. CC-BY-SA 4.0 and allows commercial use)
✅ Full institutional provenance for reduced compliance risks
✅ Systematic PII removal + quality filtering, ready for training
✅ Rich metadata for downstream customization
✅ Every document has verifiable usage rights (min. CC-BY-SA 4.0 and allows commercial use)
✅ Full institutional provenance for reduced compliance risks
✅ Systematic PII removal + quality filtering, ready for training
✅ Rich metadata for downstream customization
Our new axioms are integrated with ir_axioms: github.com/webis-de/ir_...
Nice to see axiomatic IR gaining momentum.
Our new axioms are integrated with ir_axioms: github.com/webis-de/ir_...
Nice to see axiomatic IR gaining momentum.
📄 Preprint: arxiv.org/abs/2407.21515
💻 Code: github.com/webis-de/ada...
📄 Preprint: arxiv.org/abs/2407.21515
💻 Code: github.com/webis-de/ada...
We conduct a series of exploratory analyses to show how LLM texts differ from… 2/3
We conduct a series of exploratory analyses to show how LLM texts differ from… 2/3
Submissions are open until May 10th and we look forward to your contributions.
Submissions are open until May 10th and we look forward to your contributions.
dl.acm.org/doi/10.1145/...
dl.acm.org/doi/10.1145/...
Hence, we have analyzed how well LLMs can blend product placements with "organic" responses and whether users are able to identify the ads.
dl.acm.org/doi/10.1145/...
Hence, we have analyzed how well LLMs can blend product placements with "organic" responses and whether users are able to identify the ads.
dl.acm.org/doi/10.1145/...