Thonk From First Principles

Thonk From First Principles

Home
Archive
About
Why PyTorch is an amazing place to work... and Why I'm Joining Thinking Machines
In which I convince to you to join either PyTorch or Thinking Machines!
Mar 4 • 
Horace He

August 2024

FlexAttention: The Flexibility of PyTorch with the Performance of FlashAttention [external]
Freeing users from the software lottery tyranny of fused attention implementations.
Aug 7, 2024 • 
Horace He

April 2024

Strangely, Matrix Multiplications on GPUs Run Faster When Given "Predictable" Data! [short]
Great minds discuss flops per watt.
Apr 29, 2024 • 
Horace He
Solutions: What Shapes Do Matrix Multiplications Like?
Companion to https://www.thonking.ai/p/what-shapes-do-matrix-multiplications
Apr 8, 2024 • 
Horace He
What Shapes Do Matrix Multiplications Like? [medium]
Divining order from the chaos
Apr 1, 2024 • 
Horace He

February 2024

Supporting Mixtral in gpt-fast through torch.compile [short]
Long-form version of this tweet thread: https://twitter.com/cHHillee/status/1762269069351461196
Feb 26, 2024 • 
Horace He
 and 
Yanbo Liang
© 2025 Horace He
Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture