Seong Joon Oh
banner
coallaoh.bsky.social
Seong Joon Oh
@coallaoh.bsky.social
Professor in Scalable Trustworthy AI @ University of Tübingen | Advisor at Parameter Lab & ResearchTrend.AI

https://seongjoonoh.com | https://scalabletrustworthyai.github.io/ | https://researchtrend.ai/
Thank Alex for his great efforts and work ethic. Thank @damienteney.bsky.social and @lucascimeca.bsky.social for their continued help with this paper. We’ll humbly address the criticisms to improve it further for future opportunities.
January 23, 2025 at 10:21 PM

We were a bit unlucky with the reviewers - one voted for acceptance, while the other two remained silent during the discussion phase. What matters, though, is knowing this is solid work and the method works. That’s how we survive the review process.
January 23, 2025 at 10:21 PM
- Outcome: Scaled ensemble diversification to ImageNet level, achieving improved OOD generalisation and detection.
January 23, 2025 at 10:21 PM

If you can't wait for the arXiv version, check out the ICLR forum: openreview.net/forum?id=ByC...

PS: This paper had 6 reviewers, unanimously voting for acceptance eventually (score 6+). Such luck is rare for me 😅 I'm glad that the hard work paid off, @auselis.bsky.social. Let's arXiv it!
LinkedIn
This link will take you to a page that’s not on LinkedIn
lnkd.in
January 23, 2025 at 9:58 PM

Thank other co-authors too: Alexander Rubinstein and Ehsan Abbasnejad.

paper: arxiv.org/abs/2403.07968
code: github.com/aktsonthalia...
Do Deep Neural Network Solutions Form a Star Domain?
It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means t...
arxiv.org
January 23, 2025 at 9:44 PM

I.e., at a reasonable width (no wideresnet), solutions already form a star domain.

Side note: We developed a method for finding the "star model", a special solution connected to all other solutions. I couldn't resist naming it "NeuralStarLink" but fortunately the first author Ankit held me back :)
Do Deep Neural Network Solutions Form a Star Domain?
It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means t...
arxiv.org
January 23, 2025 at 9:44 PM
✨ Coming soon: Email digests for the authors and organisations you follow. Stay tuned for more updates!

Let’s make 2025 a year of learning and staying connected! 🚀

3/3
December 31, 2024 at 4:53 PM
📩 How to get started:

- New users: Sign up here: researchtrend.ai/auth/signup and select “I agree to receive personalised daily email digests featuring the latest arXiv papers.”

- Existing users: Update your preferences in your profile: researchtrend.ai/profile.

2/3
Sign Up | ResearchTrend.AI
Explore the most trending research topics in AI
researchtrend.ai
December 31, 2024 at 4:53 PM
💡 Follow these fast-evolving domains and join their discussions on researchtrend.ai ! 🚀 5/5
ResearchTrend.AI
Explore the most trending research topics in AI
researchtrend.ai
December 29, 2024 at 5:44 AM
📊 Language Models for Tabular Data (LMTD)
While deep learning has conquered vision and language, tabular data remains a challenge. Let's see what happens in 2025. LMTD is gaining traction as researchers push the boundaries to unlock its full potential.
researchtrend.ai/communities/... 4/5
Language Models for Tabular Data (LMTD)
Leverage the power of large language models to process diverse tables and perform various tabular tasks based on natural language instructions.
researchtrend.ai
December 29, 2024 at 5:44 AM

🤖 Large-Language Model Agents (LLMAG)
2025 is shaping up to be a transformative year for LLM agents. The surge in interest reflects their growing role in automation, reasoning, and decision-making across domains.
researchtrend.ai/communities/... 3/5
Large-Language Model Agents (LLMAG)
LLM agents use LLMs as the main controller to execute complex tasks, integrating modules like planning, memory, and tool usage.
researchtrend.ai
December 29, 2024 at 5:44 AM