https://www.threads.net/@delip.rao
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
Tbh the only visual allegory possible is this...
Tbh the only visual allegory possible is this...
- Dataset of 14K+ withdrawn arXiv papers
- associated retraction comments
- entire history through 09/24
- taxonomy of retraction reasons, from critical errors to policy violations
- WithdrarXiv-SciFy, enriched version w/ scripts for parsed full-text PDFs
arxiv.org/abs/2412.03775
- Dataset of 14K+ withdrawn arXiv papers
- associated retraction comments
- entire history through 09/24
- taxonomy of retraction reasons, from critical errors to policy violations
- WithdrarXiv-SciFy, enriched version w/ scripts for parsed full-text PDFs
arxiv.org/abs/2412.03775
@deliprao.bsky.social today that I really appreciated as someone trying to break into the field. Simple categorizations can seem trite at times, but they can be deceptively profound in breaking down complex problems.
substack.com/home/post/p-...
@deliprao.bsky.social today that I really appreciated as someone trying to break into the field. Simple categorizations can seem trite at times, but they can be deceptively profound in breaking down complex problems.
substack.com/home/post/p-...
This dataset has been collected using Bluesky's API, and I hope it will be useful for all the researchers out there!
This dataset has been collected using Bluesky's API, and I hope it will be useful for all the researchers out there!
https://www.threads.net/@delip.rao
https://www.threads.net/@delip.rao
we’re re-territorializing the hilbert space
we’re re-territorializing the hilbert space