by samsena | Jan 22, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity (arXiv:2101.03961v3) 6/2022 Where are the lead Authors now? William (Liam) Fedus, Co-Founder of Periodic Labs Barret Zoph, rejoined OpenAI from Thinking...
by samsena | Jan 14, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Training language models to follow instructions with human feedback (arXiv:2203.02155, Mar 2022) Where are the lead Authors now? Long Ouyang, Research Scientist at OpenAI Jeff Wu, AI researcher, Anthropic Xu Jiang, Founder, Light Robotics Diogo...
by samsena | Jan 8, 2026 | Artificial Intelligence, General, Research, The Business of Technology
Paper’s name: LORA: Low-Rank Adaptation of Large Language Models (arXiv:2106.09685, v2 Oct 2021) Where are the Authors now? Edward Hu Founding Partner, CTO (stealth AI startup) Yelong Shen, Principal Researcher, Microsoft Azure AI Phillip Wallis, Staff Research...