by samsena | Jan 22, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity (arXiv:2101.03961v3) 6/2022 Where are the lead Authors now? William (Liam) Fedus, Co-Founder of Periodic Labs Barret Zoph, rejoined OpenAI from Thinking...
by samsena | Jan 14, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Training language models to follow instructions with human feedback (arXiv:2203.02155, Mar 2022) Where are the lead Authors now? Long Ouyang, Research Scientist at OpenAI Jeff Wu, AI researcher, Anthropic Xu Jiang, Founder, Light Robotics Diogo...
by samsena | Jan 8, 2026 | Artificial Intelligence, General, Research, The Business of Technology
Paper’s name: LORA: Low-Rank Adaptation of Large Language Models (arXiv:2106.09685, v2 Oct 2021) Where are the Authors now? Edward Hu Founding Partner, CTO (stealth AI startup) Yelong Shen, Principal Researcher, Microsoft Azure AI Phillip Wallis, Staff Research...
by samsena | Dec 30, 2025 | Artificial Intelligence, General, Research, The Business of Technology
For another installment of my list of most influential AI papers, here’s where the rubber hit the road, folks. Prior to distillation, running custom AI models was a costly and resource intensive process limited to those with deep pockets and access to large GPU...
by samsena | Dec 23, 2025 | Architecture Samples, Artificial Intelligence, Development, General, Research, The Business of Technology
Yet another installment of influential research papers that set the stage for the AI revolution. This week, it’s all about a way to augment and enhance your LLM with up to date, context relevant data without needing to go through pre-training all over again....