Lightnews — Scholar-powered news

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

🔤 Pretraining Language Models with LoRA and Artificial Languages
Nalin Kumar, Mateusz Lango, @tuetschek.bsky.social t
aclanthology.org/2025.babylm-...
Constructed artificial languages with LoRA affects language model development.

Pretraining Language Models with LoRA and Artificial Languages

Nalin Kumar, Mateusz Lango, Ondrej Dusek. Proceedings of the First BabyLM Workshop. 2025.

aclanthology.org

November 11, 2025 at 2:37 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

🎓 You are an LLM teaching a smaller model everything you know: Multi-task pretraining of language models with LLM-designed study plans
Wiktor Kamzela, Mateusz Lango, @tuetschek.bsky.social
aclanthology.org/2025.babylm-...

You are an LLM teaching a smaller model everything you know: Multi-task pretraining of language models with LLM-designed study plans

Wiktor Kamzela, Mateusz Lango, Ondrej Dusek. Proceedings of the First BabyLM Workshop. 2025.

aclanthology.org

November 11, 2025 at 2:37 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

📚 SRS-Stories: Vocabulary-constrained multilingual story generation for language learning
Wiktor Kamzela, Mateusz Lango & @toonietuesday.bsky.social
aclanthology.org/2025.emnlp-i...
LLM stories teach vocab while reviewing learned words via Spaced Repetition-more grammatical than standard generation

November 7, 2025 at 8:54 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

🤖 LLM Agents Implement an NLG System from Scratch
Mateusz Lango, Ondrej Dusek
aclanthology.org/2025.emnlp-i...
LLM agents can autonomously build interpretable, rule-based RDF-to-text generators from scratch, combining the LLMs with the transparency and reliability of traditional rule-based systems.

LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators

Mateusz Lango, Ondrej Dusek. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track. 2025.

aclanthology.org

November 7, 2025 at 8:54 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

👥 Can Large Language Models Personalize Dialogues to Generational Styles?
P. Balestrucci, @tuetschek.bsky.social, L. Anselma, A. Mazzei
aclanthology.org/2025.finding...
Can LLMs adapt dialogues to generational styles? We show with P-MultiWoZ that models capture patterns from Boomers to Gen Z.

Can Large Language Models Personalize Dialogues to Generational Styles?

Pier Felice Balestrucci, Ondrej Dusek, Luca Anselma, Alessandro Mazzei. Findings of the Association for Computational Linguistics: EMNLP 2025. 2025.

aclanthology.org

November 7, 2025 at 8:54 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

📊 Real-World Summarization: When Evaluation Reaches Its Limits
@patuchen.bsky.social , @tuetschek.bsky.social , @saad.me.uk
aclanthology.org/2025.finding...
For hotel highlights, metrics like word overlap surprisingly match human judgments better than complex methods. LLMs unreliable as evaluators.

Real-World Summarization: When Evaluation Reaches Its Limits

Patrícia Schmidtová, Ondrej Dusek, Saad Mahamood. Findings of the Association for Computational Linguistics: EMNLP 2025. 2025.

aclanthology.org

November 7, 2025 at 8:54 PM

Reposted by Ondrej Dusek

Alison

@alisonbee.bsky.social

October 25, 2025 at 11:32 PM

Reposted by Ondrej Dusek

SIGGEN

@siggen.bsky.social

The registration page for #INLG2025 is now live! Join us in Vietnam at the Oct 29 - Nov 2 for the best conference on #NaturalLanguageGeneration

2025.inlgmeeting.org/registration...

Curious to see what will be presented? Check out this list of accepted papers! 2025.inlgmeeting.org/accepted-pap...

Picture of the One Pillar Pagoda in Hanoi, a pagoda raised up over a green pond surrounded by greenery

September 16, 2025 at 12:15 PM

Reposted by Ondrej Dusek

Svitlana Vakulenko

@vendi12.bsky.social

Check out the slides from our SCAI'2025 #convsearch workshop collocated with @ijcai.org #IJCAI2025 on LLMs, retrieval & QA, recommendations, negotiations, evaluation and transparency

scai.info/scai-2025

@patuchen.bsky.social @maik-froebe.bsky.social @tuetschek.bsky.social @mila-quebec.bsky.social

SCAI 2025

Online Event on Search-Oriented Conversational AI.

scai.info

September 9, 2025 at 10:34 AM

Reposted by Ondrej Dusek

Ivan Kartáč

@ivankartac.bsky.social

Our paper "OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs" has been accepted to #INLG2025 conference!

You can read the preprint here: arxiv.org/abs/2503.11858

August 23, 2025 at 4:36 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

FreshTab: Sourcing Fresh Resources for Table-to-Text Generation Evaluation
by @navitas.bsky.social, ‪@oplatek.bsky.social‬, ‪@zdenekkasner.bsky.social‬, @tuetschek.bsky.social .bsky.social‬

July 31, 2025 at 1:30 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

ReproHum #0669-08: Reproducing Sentiment Transfer Evaluation
by @navitas.bsky.social, M. Lango, @patuchen.bsky.social, @tuetschek.bsky.social
Challenge to reproduce human evaluations from NLP papers, testing the reproducibility of evaluation studies

July 31, 2025 at 1:30 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
by @ivankartac.bsky.social, M. Lango, @tuetschek.bsky.social
arxiv.org/abs/2503.11858
Open-source NLG evaluation metric that explains errors and matches human judgments without proprietary models

July 31, 2025 at 1:30 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

#ACL2025NLP in Vienna 🇦🇹 starts today with 23 🤯 @ufal-cuni.bsky.social folks presenting their work both at the main conference and workshops. Check out our main conference papers today and on Wednesday 👇

July 28, 2025 at 7:27 AM

Reposted by Ondrej Dusek

Mark Riedl

@markriedl.bsky.social

ICML found hidden prompts in accepted papers. They have released a statement icml.cc/Conferences/...

Yes, it’s unacceptable. So is using an LLM to review a paper. Peer review is so broken.

July 24, 2025 at 1:44 AM

Reposted by Ondrej Dusek

TBSkyen

@tbskyen.com

#Eurovision is tonight, and here's a hilarious fun fact about it: Israel has started a massive offensive against civilian populations in Gaza with the explicit aim of conquering the entire territory and ethnically cleansing its population, and Eurovision has aggressively refused to give a shit.

May 17, 2025 at 10:48 PM

Reposted by Ondrej Dusek

Tim Onion

@bencollins.bsky.social

It is a little weird to me countries aren’t more aggressively, formally trying to take advantage of the U.S. science brain drain. Once in a lifetime opportunity to buy low on Non-Dumbass Americans with PhDs who just wanna look into microscopes and quietly cure ass cancer as our country eats shit.

May 4, 2025 at 8:54 PM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

The 👉Machine Learning Prague 2025👈 is happening right now! Today, @patuchen.bsky.social and @navitas.bsky.social presented their posters on text generation with LLMs. Also, don't miss @tuetschek.bsky.social's invited talk tomorrow at 11 a.m.

April 29, 2025 at 2:08 PM

Reposted by Ondrej Dusek

Tokenization Workshop (TokShop) @ICML2025

@tokshop.bsky.social

🚨 NEW WORKSHOP ALERT 🚨

We're thrilled to announce the first-ever Tokenization Workshop (TokShop) at #ICML2025 @icmlconf.bsky.social! 🎉

Submissions are open for work on tokenization across all areas of machine learning.

📅 Submission deadline: May 30, 2025
🔗 tokenization-workshop.github.io

Tokenization Workshop @ ICML 2025

tokenization-workshop.github.io

April 15, 2025 at 5:23 PM

Reposted by Ondrej Dusek

leon

@leyawn.bsky.social

“is my calculator horny?“ our tech columnist asks. “i entered 5318008 into it and turned it upside down. what i saw surprised me”

The New York Times @nytimes.com · Apr 24

“Can ChatGPT experience joy or suffering? Does Gemini deserve human rights?” our tech columnist asks. “Many A.I. experts I know would say no, not yet, not even close. But I was intrigued.”

Should We Start Taking the Welfare of A.I. Seriously?

As artificial intelligence systems become smarter, one A.I. company is trying to figure out what to do if they become conscious.

www.nytimes.com

April 24, 2025 at 6:24 PM

Reposted by Ondrej Dusek

Zdeněk Kasner

@zdenekkasner.bsky.social

How do LLMs compare to human crowdworkers in annotating text spans? 🧑🤖

And how can span annotation help us with evaluating texts?

Find out in our new paper: llm-span-annotators.github.io

Arxiv: arxiv.org/abs/2504.08697

Large Language Models as Span Annotators

Website for the paper Large Language Models as Span Annotators

llm-span-annotators.github.io

April 15, 2025 at 11:10 AM

Reposted by Ondrej Dusek

Institute of Formal and Applied Linguistics

@ufal.mff.cuni.cz

Participate in the 👉 CRAC 2025 Shared Task on Multilingual Coreference Resolution❗ ufal.mff.cuni.cz/corefud/crac25
If you have not already done so, register first. 👆 Then start discovering how words refer to each other in 1️⃣7️⃣ languages. This year includes a new ✨LLM✨ track 😮.

April 9, 2025 at 3:02 PM

Reposted by Ondrej Dusek

Radek Šimík

@radeksimik.bsky.social

A 3-year full-time post-doc position in Prague! I'll be grateful for reposts. Feel free to get in touch if you have questions. linguistlist.org/issues/36/10...

LINGUIST List 36.1018 Jobs: Morphology, Pragmatics, Semantics, Syntax: Post-Doc in Empirical and Theoretical Linguistics, Faculty of Arts, Charles University

The LINGUIST List, International Linguistics Community Online.

linguistlist.org

March 24, 2025 at 4:10 PM

Reposted by Ondrej Dusek

Dare Obasanjo

@carnage4life.bsky.social

It’s kind of quaint to think the big worry a few years ago was that AI chatbots would destroy humanity. We’re quite capable of doing that without their help.

March 22, 2025 at 11:09 AM

Reposted by Ondrej Dusek

Magazín UK Forum

@ukforum.cuni.cz

👨‍💻👩‍💻 Pod vedením @ufal-cuni.bsky.social #MFFUK @unikarlova.cuni.cz se začíná budovat rodina velkých jazykových modelů pro všechny evropské jazyky. V Karolinu dnes odstartoval mezinárodní projekt @openeurollm.bsky.social. 👏
www.ukforum.cz/rubriky/aktu...

Univerzita Karlova v čele evropského výzkumu AI

„Naším hlavním cílem je vyrobit jazykový model, který bude konkurencí stávajícím modelům, a navíc bude fungovat velmi dobře pro všechny evropské jazyky,“ uvedl profesor Jan Hajič z ÚFAL MFF UK, který ...

www.ukforum.cz

March 7, 2025 at 2:45 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news