Sewon Lee
sewonlee.bsky.social
Sewon Lee
@sewonlee.bsky.social
Studying RecSys, Bioinfo @ Seoul National Univ. 🇰🇷 • former Steinegger Lab • 🧠 → 🧬 → 💻
Reposted by Sewon Lee
Our structural core gene pipeline Unicode is now published at GBE
📄 doi.org/10.1093/gbe/...

Please also check out @dongwookkim.bsky.social’s
🧵 bsky.app/profile/dong...
June 3, 2025 at 5:19 PM
Reposted by Sewon Lee
Unicore is now published on GBE 🚀
Unicore rapidly identifies structural single-copy core genes from input species proteomes for phylogenetic analysis. Powered by Foldseek and ProstT5, Unicore enables linear-scale structure-based phylogeny of any given set of taxa. 🧵1/n
📃 doi.org/10.1093/gbe/evaf109
June 3, 2025 at 6:55 AM
Reposted by Sewon Lee
Can't wait for when I can vibe code a production recommender system.

Until then, here's some system designs:

• Retrieval vs. Ranking: eugeneyan.com/writing/syst...
• Real-time retrieval: eugeneyan.com/writing/real...
• Personalization: eugeneyan.com/writing/patt...
April 8, 2025 at 5:14 AM
Reposted by Sewon Lee
What industrial recsys papers have you enjoyed or found useful in the past year or two? Sharing my list:

# 1. Integrating LLMs into recsys

1.1. LLM-augmented recommenders
• Better Generalization with Semantic IDs: A Case Study in Ranking for Recommendations
- arxiv.org/abs/2306.08121
January 31, 2025 at 3:23 AM
Reposted by Sewon Lee
SNU Profs Woon Ju Song & Martin Steinegger (Biology) developed the AI-based SeekRank algorithm to discover enzymes for cancer immunotherapy. doi.org/10.1093/nar/...
Discovery of highly active kynureninases for cancer immunotherapy through protein language model
Abstract. Tailor-made enzymes empower a wide range of versatile applications, although searching for the desirable enzymes often requires high throughput s
doi.org
January 8, 2025 at 5:58 AM
Reposted by Sewon Lee
Unicore identifies single-copy protein structures across genomes using Foldseek, bypassing slow structure predictions by utilizing 3Di predictions from ProstT5, enabling rapid phylogenetic inference at the tree-of-life scale. 1/n
📄 www.biorxiv.org/content/10.1...
💾 github.com/steineggerla...
December 23, 2024 at 4:39 PM
Reposted by Sewon Lee
Unicore enables scalable and accurate phylogenetic reconstruction with structural core genes https://www.biorxiv.org/content/10.1101/2024.12.22.629535v1
December 23, 2024 at 3:51 AM
Reposted by Sewon Lee
And if you're looking for more learning over the long thanksgiving weekend, this could be a good place to start: eugeneyan.com/start-here/
November 27, 2024 at 5:29 PM
Reposted by Sewon Lee
Our Big Fantastic Virus Database (BFVD) is now published NAR! It contains protein structure predictions of major viral clades, enhanced by petabase-scale homology search and it's explorable on the web.
🌐 bfvd.foldseek.com
💾 bfvd.steineggerlab.workers.dev
📄 academic.oup.com/nar/advance-...
November 23, 2024 at 9:12 PM