Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
Forem
Close
#
inference
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
BeeLlama v0.2.0: 164 tok/s on a 27B model, one RTX 3090
Thousand Miles AI
Thousand Miles AI
Thousand Miles AI
Follow
May 23
BeeLlama v0.2.0: 164 tok/s on a 27B model, one RTX 3090
#
ai
#
llm
#
inference
#
opensource
Comments
Add Comment
3 min read
RAM Coffers: NUMA-Aware LLM Inference — Why Hardware Topology Still Matters
BossChaos
BossChaos
BossChaos
Follow
May 22
RAM Coffers: NUMA-Aware LLM Inference — Why Hardware Topology Still Matters
#
ai
#
hardware
#
inference
Comments
Add Comment
2 min read
Your AI speed benchmark is measuring the one workload you don't run
Thousand Miles AI
Thousand Miles AI
Thousand Miles AI
Follow
May 19
Your AI speed benchmark is measuring the one workload you don't run
#
discuss
#
ai
#
llm
#
inference
Comments
Add Comment
3 min read
Async Batching Is the Real Latency Win Nobody's Talking About
Aamer Mihaysi
Aamer Mihaysi
Aamer Mihaysi
Follow
May 15
Async Batching Is the Real Latency Win Nobody's Talking About
#
llm
#
inference
#
async
1
 reaction
Comments
Add Comment
3 min read
ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning
Jangwook Kim
Jangwook Kim
Jangwook Kim
Follow
May 11
ReFlect: Training-Free Error Recovery for Long-Horizon LLM Reasoning
#
llmreasoning
#
agents
#
inference
#
arxiv2026
Comments
Add Comment
4 min read
Why Most Browser AI Demos Fail on Real Hardware
Bruno Juca
Bruno Juca
Bruno Juca
Follow
May 10
Why Most Browser AI Demos Fail on Real Hardware
#
ai
#
inference
#
hardware
#
benchmark
Comments
Add Comment
4 min read
The Inference Inversion
David Aronchick
David Aronchick
David Aronchick
Follow
May 5
The Inference Inversion
#
distributedcomputing
#
edgecomputing
#
nvidia
#
inference
Comments
Add Comment
7 min read
First Confirmed Directional Move on the AI Inference Frontier Index in 2026
Steriani Karamanlis
Steriani Karamanlis
Steriani Karamanlis
Follow
May 12
First Confirmed Directional Move on the AI Inference Frontier Index in 2026
#
ai
#
llm
#
inference
#
pricing
Comments
Add Comment
4 min read
Tutorial: This AI Now Tells You if a Meeting Could Be an Email
Andrew Dugan
Andrew Dugan
Andrew Dugan
Follow
for
DigitalOcean
May 21
Tutorial: This AI Now Tells You if a Meeting Could Be an Email
#
ai
#
tutorial
#
agentskills
#
inference
3
 reactions
Comments
Add Comment
8 min read
Tutorial: Build a Cost-Aware AI Support Triage API
James Skelton
James Skelton
James Skelton
Follow
for
DigitalOcean
May 19
Tutorial: Build a Cost-Aware AI Support Triage API
#
ai
#
tutorial
#
api
#
inference
3
 reactions
Comments
1
 comment
13 min read
Muse Spark beats Llama 4 with 10x less compute. Here's how.
Gabriel Anhaia
Gabriel Anhaia
Gabriel Anhaia
Follow
Apr 26
Muse Spark beats Llama 4 with 10x less compute. Here's how.
#
ai
#
llm
#
architecture
#
inference
Comments
Add Comment
7 min read
First Words: LLM Inference on RISC-V
Bruno Verachten
Bruno Verachten
Bruno Verachten
Follow
Apr 22
First Words: LLM Inference on RISC-V
#
bananapi
#
benchmark
#
inference
#
llamacpp
Comments
Add Comment
9 min read
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 13
Gaussian Process Regression: The Bayesian Approach to Curve Fitting
#
bayesian
#
supervisedlearning
#
probabilistic
#
inference
Comments
Add Comment
13 min read
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
Alan West
Alan West
Alan West
Follow
Apr 7
Google Dropped TurboQuant Two Weeks Ago. The Community Already Made It Usable.
#
turboquant
#
locallm
#
inference
#
opensource
1
 reaction
Comments
Add Comment
6 min read
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
Berkan Sesen
Berkan Sesen
Berkan Sesen
Follow
Apr 26
Hierarchical Bayesian Regression with PyMC: When Groups Share Strength
#
bayesian
#
probabilistic
#
inference
#
pymc
1
 reaction
Comments
Add Comment
13 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a blogging-forward open source social network where we learn from one another
Log in
Create account