Thonk From First Principles

Thonk From First Principles

Home
Archive
About
Why PyTorch is an amazing place to work... and Why I'm Joining Thinking Machines
In which I convince to you to join either PyTorch or Thinking Machines!
Mar 4 • 
Horace He
70
7

August 2024

FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention [external]
Freeing users from the software lottery tyranny of fused attention implementations.
Aug 7, 2024 • 
Horace He
18
2

April 2024

Strangely, Matrix Multiplications on GPUs Run Faster When Given "Predictable" Data! [short]
Great minds discuss flops per watt.
Apr 29, 2024 • 
Horace He
125
22
Solutions: What Shapes Do Matrix Multiplications Like?
Companion to https://www.thonking.ai/p/what-shapes-do-matrix-multiplications
Apr 8, 2024 • 
Horace He
10
What Shapes Do Matrix Multiplications Like? [medium]
Divining order from the chaos
Apr 1, 2024 • 
Horace He
78
2

February 2024

Supporting Mixtral in gpt-fast through torch.compile [short]
Long-form version of this tweet thread: https://twitter.com/cHHillee/status/1762269069351461196
Feb 26, 2024 • 
Horace He
 and 
Yanbo Liang
10
4
© 2025 Horace He
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture