Sooyoung Cha
banner
sooyoung-cha.bsky.social
Sooyoung Cha
@sooyoung-cha.bsky.social
🎓 ECE @Seoul National University
🇰🇷 Grad student @SteineggerLab
Reposted by Sooyoung Cha
MMseqs2-GPU sets new standards in single query search speed, allows near instant search of big databases, scales to multiple GPUs and is fast beyond VRAM. It enables ColabFold MSA generation in seconds and sub-second Foldseek search against AFDB50. 1/n
📄 www.nature.com/articles/s41...
💿 mmseqs.com
GPU-accelerated homology search with MMseqs2 - Nature Methods
Graphics processing unit-accelerated MMseqs2 offers tremendous speedups for homology retrieval from metagenomic databases, query-centered multiple sequence alignment generation for structure predictio...
www.nature.com
September 21, 2025 at 8:06 AM
Reposted by Sooyoung Cha
Today at 5pm, @eunbelivable.bsky.social will present her work on the Big Fantastic Viral Database (BFVD) at #ISMB2025 in BOSC. She also has a poster B-123 (tomorrow, 22nd), so please drop by to have ta chat and grab some stickers!
📄 academic.oup.com/nar/article/...
July 21, 2025 at 9:29 AM
Reposted by Sooyoung Cha
AFESM: a metagenomic guide through the protein structure universe! We clustered 821M structures (AFDB&ESMatlas) into 5.12M groups; revealing biome-specific groups, only 1 new fold even after AlphaFold2 re-prediction & many novel domain combos. 🧵
🌐 afesm.foldseek.com
📄 www.biorxiv.org/content/10.1...
April 27, 2025 at 12:13 AM
Reposted by Sooyoung Cha
The Foldseek webserver for fast protein structure searches now features a Sankey tree taxonomy visualization and filter, allowing to subset hits by clades. Developed by my talented student @sunjaelee.bsky.social. Try it out!
🌐 search.foldseek.com
January 17, 2025 at 12:33 PM
Reposted by Sooyoung Cha
Foldseek 10 with 4-27x (1-8 GPUs) faster search through MMseqs2-GPU. Faster ProstT5 protein search w/o structure prediction through multi-GPU/Apple Metal, new BFVD/BFMD databases and multimer clustering (preview).
💾 github.com/steineggerla...
📄 www.biorxiv.org/content/10.1...
🐍 available in bioconda
January 20, 2025 at 2:56 PM
Reposted by Sooyoung Cha
Unicore identifies single-copy protein structures across genomes using Foldseek, bypassing slow structure predictions by utilizing 3Di predictions from ProstT5, enabling rapid phylogenetic inference at the tree-of-life scale. 1/n
📄 www.biorxiv.org/content/10.1...
💾 github.com/steineggerla...
December 23, 2024 at 4:39 PM
Reposted by Sooyoung Cha
MMseqs2 Release 16 Highlights: GPU-accelerated search📄, ORF or new 6-frame translated search modes, contig taxonomy always keeps the longest ORF, bug fixes (reduced memory and higher sensitivity) and relicensed as MIT
📄 biorxiv.org/content/10.1...
💾 mmseqs.com and 🐍Bioconda 🖥️🧬🧶
November 27, 2024 at 9:08 AM
Reposted by Sooyoung Cha
Our Big Fantastic Virus Database (BFVD) is now published NAR! It contains protein structure predictions of major viral clades, enhanced by petabase-scale homology search and it's explorable on the web.
🌐 bfvd.foldseek.com
💾 bfvd.steineggerlab.workers.dev
📄 academic.oup.com/nar/advance-...
November 23, 2024 at 9:12 PM