Web: https://shashankgupta.info/
Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!
Links in thread 👇
Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!
Links in thread 👇
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models 👇
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models 👇