by samsena | Feb 18, 2026 | Artificial Intelligence, Development, General, Research, The Business of Technology
It’s time for our weekly dose of influential AI research papers. This time, a performance no-brainer… A subtle shift in prompting that can reap significant improvements at no additional costs! Paper’s name: Chain-of-Thought Prompting Elicits...
by samsena | Feb 9, 2026 | Artificial Intelligence, Development, General, Research, The Business of Technology
Here is another foray into the world of large language model optimization, it’s all about quantization, folks! Paper’s name: LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale (NeurIPS 2022; arXiv:2208.07339 11/22) Where are the Authors now?...
by samsena | Jan 22, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity (arXiv:2101.03961v3) 6/2022 Where are the lead Authors now? William (Liam) Fedus, Co-Founder of Periodic Labs Barret Zoph, rejoined OpenAI from Thinking...
by samsena | Jan 14, 2026 | Architecture Guides, Artificial Intelligence, Development, General, Research, The Business of Technology
Paper’s name: Training language models to follow instructions with human feedback (arXiv:2203.02155, Mar 2022) Where are the lead Authors now? Long Ouyang, Research Scientist at OpenAI Jeff Wu, AI researcher, Anthropic Xu Jiang, Founder, Light Robotics Diogo...
by samsena | Jan 8, 2026 | Artificial Intelligence, General, Research, The Business of Technology
Paper’s name: LORA: Low-Rank Adaptation of Large Language Models (arXiv:2106.09685, v2 Oct 2021) Where are the Authors now? Edward Hu Founding Partner, CTO (stealth AI startup) Yelong Shen, Principal Researcher, Microsoft Azure AI Phillip Wallis, Staff Research...