Lewis Won

A software engineer with interest in LLMs

Singapore Joined on Nov 30, 2024

Cover image for How does low-rank adaptation for large language models work

Lewis Won

Sep 13 '25

How does low-rank adaptation for large language models work

#llm #machinelearning #ai #algorithms

21 min read

Want to connect with Lewis Won?

Create an account to connect with Lewis Won. You can also sign in below to proceed if you already have an account.

Create Account

Already have an account? Sign in

Lewis Won

Sep 6 '25

FlashAttention by hand

#llm #algorithms #machinelearning #ai

23 min read

Lewis Won

Aug 28 '25

Online softmax by hand

#llm #algorithms #machinelearning #ai

20 min read

Cover image for Tensor parallelism by hand

Lewis Won

Aug 23 '25

Tensor parallelism by hand

#machinelearning #pytorch #distributedsystems #llm

28 min read

Cover image for Routing and balancing losses with Mixture of Experts

Lewis Won

Aug 16 '25

Routing and balancing losses with Mixture of Experts

17 min read

Lewis Won

Aug 9 '25

Plain Guide to Einops

#machinelearning #tensorflow #programming #python

16 min read

Cover image for ZeRO by hand with a 4-parameter model

Lewis Won

Aug 1 '25

ZeRO by hand with a 4-parameter model

#distributedsystems #llm #machinelearning #ai

23 min read

Cover image for From Scatter to All-Reduce: A Plain-English Guide to Collective Operations

Lewis Won

Jul 25 '25

From Scatter to All-Reduce: A Plain-English Guide to Collective Operations

#programming #distributedsystems

21 min read

Cover image for Demystifying GPUs: From Core Architecture to Scalable Systems

Lewis Won

Jul 20 '25

Demystifying GPUs: From Core Architecture to Scalable Systems

#nvidia #gpu #architecture #cuda

12 min read

Cover image for Key Insights Gained from Building and Training LLM

Lewis Won

Jul 12 '25

Key Insights Gained from Building and Training LLM

#llm #machinelearning #pytorch #ai

27 min read

Cover image for Contextual chunking for Retrieval Augmented Generation

Lewis Won

Jul 5 '25

Contextual chunking for Retrieval Augmented Generation

#rag #productivity #tutorial #vectordatabase

59 min read

Cover image for Implementing DeepSeek-R1 GRPO in Apple MLX framework

Lewis Won

Jun 21 '25

Implementing DeepSeek-R1 GRPO in Apple MLX framework

#python #machinelearning #deeplearning #deepseek

29 min read

Cover image for Efficient self-attention mechanism

Lewis Won

Jun 15 '25

Efficient self-attention mechanism

#llm #nlp #machinelearning #datascience

31 min read

Cover image for Creating the self-attention mechanism from scratch

Lewis Won

Jun 8 '25

Creating the self-attention mechanism from scratch

#llm #tutorial #pytorch #chatgpt

22 min read

Cover image for Navigating the world of Harry Potter with Knowledge Graphs

Lewis Won

Dec 5 '24

Navigating the world of Harry Potter with Knowledge Graphs

#langchain #knowledgegraph #python #rag

6 min read

Forem

Lewis Won

Badges

One Year Club

Writing Debut

How does low-rank adaptation for large language models work

Want to connect with Lewis Won?

FlashAttention by hand

Online softmax by hand

Tensor parallelism by hand

Routing and balancing losses with Mixture of Experts

Plain Guide to Einops

ZeRO by hand with a 4-parameter model

From Scatter to All-Reduce: A Plain-English Guide to Collective Operations

Demystifying GPUs: From Core Architecture to Scalable Systems

Key Insights Gained from Building and Training LLM

Contextual chunking for Retrieval Augmented Generation

Implementing DeepSeek-R1 GRPO in Apple MLX framework

Efficient self-attention mechanism

Creating the self-attention mechanism from scratch

Navigating the world of Harry Potter with Knowledge Graphs