4.9 • 848 Ratings
🗓️ 9 February 2025
⏱️ 42 minutes
🔗️ Recording | iTunes | RSS
🧾️ Download transcript
Try a walking desk while studying ML or working on your projects! https://ocdevel.com/walk
Show notes: https://ocdevel.com/mlg/33
3Blue1Brown videos: https://3blue1brown.com/
Background & Motivation:
Core Architecture:
Self-Attention Mechanism:
Masking:
Feed-Forward Networks (MLPs):
Residual Connections & Normalization:
Scalability & Efficiency Considerations:
Training Paradigms & Emergent Properties:
Interpretability & Knowledge Distribution:
Click on a timestamp to play from that location
0:00.0 | Welcome back to Machine Learning Guide. I'm your host, Tyler Rinelli. MLG teaches the fundamentals of machine learning and artificial intelligence. |
0:09.0 | It covers intuition, models, math, languages, frameworks, and more. |
0:13.0 | Where your other machine learning resources provide the trees, I provide the forest. |
0:18.0 | Visual is the best primary learning modality, but audio is a great supplement during exercise commute and chores. |
0:25.7 | Consider MLG your syllabus with highly curated resources for each episode's details at OCdevel.com forward slash MLG. |
0:35.6 | Speaking of curation, I'm a curator of life hacks, my favorite hack being treadmill desks. |
0:40.9 | While you study machine learning or work on your machine learning projects, walk. |
0:44.8 | This helps improve focus by increasing blood flow and endorphins. |
0:48.0 | This maintains consistency and energy, alertness, focus, and mood. |
0:52.6 | Get your CDC recommended 10,000 steps while studying or working. |
0:56.6 | I get about 20,000 steps per day, walking just two miles per hour, which is sustainable without |
1:01.1 | instability at the mouse or keyboard. Save time and money on your fitness goals. See a link to my |
1:06.3 | favorite walking desk setup in the show notes. Today we're going to talk about Transformers, |
1:10.8 | the revolutionary |
1:11.9 | technology behind large language models, the technology put out by the attention is all you need |
1:18.0 | white paper. And transformers are not as hairy of a concept as you might think they are. If you |
1:23.7 | tried reading the attention is all you need white paper and it was just an earful, then stay tuned to this episode. |
1:30.2 | I'll try to break it down. |
1:31.1 | There's also a video I'll reference at the end when we talk about the resources by three blue one brown. |
1:36.5 | It's really not as complex as I thought it was. |
1:38.7 | In fact, it's sort of a step back in technology in terms of things we're getting more and more compounded |
1:45.5 | and complex in neural network architectures. |
... |
Please login to see the full transcript.
Disclaimer: The podcast and artwork embedded on this page are from OCDevel, and are the property of its owner and not affiliated with or endorsed by Tapesearch.
Generated transcripts are the property of OCDevel and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.
Copyright © Tapesearch 2025.