Link to Original: 97% Cheaper, Faster, Better, Correct AI — with Varun Mohan of Codeium
Summary
Guest: Varun Mohan from Codeium / Exafunction.
Background:
Varun studied CS at MIT and later became the tech lead manager for autonomy at Nuro, focusing on self-driving cars and AI.
Subsequently, he co-founded Exafunction with colleagues from Nuro.
Varun's team successfully cloned GitHub Copilot in a short period.
Personal Insights:
Varun is passionate about endurance sports, such as triathlons and long-distance cycling.
These sports provide a mental break for him, allowing his mind to focus solely on the immediate physical challenge.
Exafunction:
Born from Varun's experience at Nuro, Exafunction aims to simplify the complexities of deep learning infrastructure for businesses.
The company offers solutions to optimize GPU utilization, ensuring that deep learning operations are cost-effective and efficient.
They introduced techniques such as dynamic multiplexing and other solutions to address the under-utilization of GPUs by many companies.
Varun believes that many companies should use off-the-shelf architectures and models, fine-tuning existing ones like Bert and ResNet.
Code:
Exafunction's infrastructure efficiency inspired the team to consider consumer-facing products.
They see value in applications like GitHub Copilot, which provide real-time support for developers.
Varun and his team personally experienced the benefits of Copilot, especially when writing complex codes, emphasizing its potential beyond being just a tool for basic completions.
The podcast highlighted the advancements in deep learning infrastructure, the challenges in GPU optimization, and the future of real-time coding assistance tools.