Lightnews — Scholar-powered news

Seong Joon Oh

@coallaoh.bsky.social

Paper: arxiv.org/abs/2409.16797

Code: github.com/AlexanderRub...

OpenReview: openreview.net/forum?id=BQE...

Scalable Ensemble Diversification for OOD Generalization and Detection

Training a diverse ensemble of models has several practical applications such as providing candidates for model selection with better out-of-distribution (OOD) generalization, and enabling the detecti...

arxiv.org

January 23, 2025 at 10:21 PM

Seong Joon Oh

@coallaoh.bsky.social

Thank Alex for his great efforts and work ethic. Thank @damienteney.bsky.social and @lucascimeca.bsky.social for their continued help with this paper. We’ll humbly address the criticisms to improve it further for future opportunities.

January 23, 2025 at 10:21 PM

Seong Joon Oh

@coallaoh.bsky.social

We were a bit unlucky with the reviewers - one voted for acceptance, while the other two remained silent during the discussion phase. What matters, though, is knowing this is solid work and the method works. That’s how we survive the review process.

January 23, 2025 at 10:21 PM

Seong Joon Oh

@coallaoh.bsky.social

- Outcome: Scaled ensemble diversification to ImageNet level, achieving improved OOD generalisation and detection.

January 23, 2025 at 10:21 PM

Seong Joon Oh

@coallaoh.bsky.social

If you can't wait for the arXiv version, check out the ICLR forum: openreview.net/forum?id=ByC...

PS: This paper had 6 reviewers, unanimously voting for acceptance eventually (score 6+). Such luck is rare for me 😅 I'm glad that the hard work paid off, @auselis.bsky.social. Let's arXiv it!

This link will take you to a page that’s not on LinkedIn

lnkd.in

January 23, 2025 at 9:58 PM

Seong Joon Oh

@coallaoh.bsky.social

Thank other co-authors too: Alexander Rubinstein and Ehsan Abbasnejad.

paper: arxiv.org/abs/2403.07968
code: github.com/aktsonthalia...

Do Deep Neural Network Solutions Form a Star Domain?

It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means t...

arxiv.org

January 23, 2025 at 9:44 PM

Seong Joon Oh

@coallaoh.bsky.social

I.e., at a reasonable width (no wideresnet), solutions already form a star domain.

Side note: We developed a method for finding the "star model", a special solution connected to all other solutions. I couldn't resist naming it "NeuralStarLink" but fortunately the first author Ankit held me back :)

Do Deep Neural Network Solutions Form a Star Domain?

It has recently been conjectured that neural network solution sets reachable via stochastic gradient descent (SGD) are convex, considering permutation invariances (Entezari et al., 2022). This means t...

arxiv.org

January 23, 2025 at 9:44 PM

Seong Joon Oh

@coallaoh.bsky.social

✨ Coming soon: Email digests for the authors and organisations you follow. Stay tuned for more updates!

Let’s make 2025 a year of learning and staying connected! 🚀

3/3

December 31, 2024 at 4:53 PM

Seong Joon Oh

@coallaoh.bsky.social

📩 How to get started:

- New users: Sign up here: researchtrend.ai/auth/signup and select “I agree to receive personalised daily email digests featuring the latest arXiv papers.”

- Existing users: Update your preferences in your profile: researchtrend.ai/profile.

2/3

Sign Up | ResearchTrend.AI

Explore the most trending research topics in AI

researchtrend.ai

December 31, 2024 at 4:53 PM

Seong Joon Oh

@coallaoh.bsky.social

💡 Follow these fast-evolving domains and join their discussions on researchtrend.ai ! 🚀 5/5

ResearchTrend.AI

Explore the most trending research topics in AI

researchtrend.ai

December 29, 2024 at 5:44 AM

Seong Joon Oh

@coallaoh.bsky.social

📊 Language Models for Tabular Data (LMTD)
While deep learning has conquered vision and language, tabular data remains a challenge. Let's see what happens in 2025. LMTD is gaining traction as researchers push the boundaries to unlock its full potential.
researchtrend.ai/communities/... 4/5

Language Models for Tabular Data (LMTD)

Leverage the power of large language models to process diverse tables and perform various tabular tasks based on natural language instructions.

researchtrend.ai

December 29, 2024 at 5:44 AM

Seong Joon Oh

@coallaoh.bsky.social

🤖 Large-Language Model Agents (LLMAG)
2025 is shaping up to be a transformative year for LLM agents. The surge in interest reflects their growing role in automation, reasoning, and decision-making across domains.
researchtrend.ai/communities/... 3/5

Large-Language Model Agents (LLMAG)

LLM agents use LLMs as the main controller to execute complex tasks, integrating modules like planning, memory, and tool usage.

researchtrend.ai

December 29, 2024 at 5:44 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news