Forem

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Fuzzy-match millions of rows in Databricks (2026)
Cover image for Fuzzy-match millions of rows in Databricks (2026)

Fuzzy-match millions of rows in Databricks (2026)

9
Comments
5 min read
Logistic Regression: The Bouncer Who Gives Probability of Entry Instead of Just Yes/No
Cover image for Logistic Regression: The Bouncer Who Gives Probability of Entry Instead of Just Yes/No

Logistic Regression: The Bouncer Who Gives Probability of Entry Instead of Just Yes/No

Comments
13 min read
Lasso Regression: The Brutal Manager Who Said 'Some of You Are Getting Fired' — And Actually Did It
Cover image for Lasso Regression: The Brutal Manager Who Said 'Some of You Are Getting Fired' — And Actually Did It

Lasso Regression: The Brutal Manager Who Said 'Some of You Are Getting Fired' — And Actually Did It

Comments
12 min read
Elastic Net: The Mediator Who Said 'Let's Take the Best of Both Approaches'
Cover image for Elastic Net: The Mediator Who Said 'Let's Take the Best of Both Approaches'

Elastic Net: The Mediator Who Said 'Let's Take the Best of Both Approaches'

Comments
11 min read
Ridge Regression: The Manager Who Said 'Everyone Gets a Small Piece' Instead of 'Winner Takes All'
Cover image for Ridge Regression: The Manager Who Said 'Everyone Gets a Small Piece' Instead of 'Winner Takes All'

Ridge Regression: The Manager Who Said 'Everyone Gets a Small Piece' Instead of 'Winner Takes All'

Comments
12 min read
When Linear Regression Assumptions Are Violated: The Bridge Engineer Who Ignored the Cracks and Declared It Safe
Cover image for When Linear Regression Assumptions Are Violated: The Bridge Engineer Who Ignored the Cracks and Declared It Safe

When Linear Regression Assumptions Are Violated: The Bridge Engineer Who Ignored the Cracks and Declared It Safe

Comments
12 min read
Day 5 : Is Your Model Actually Good? - Evaluation Metrics
Cover image for Day 5 : Is Your Model Actually Good? - Evaluation Metrics

Day 5 : Is Your Model Actually Good? - Evaluation Metrics

5
Comments
2 min read
Decision Trees: The Detective Who Solves Cases by Asking Yes/No Questions
Cover image for Decision Trees: The Detective Who Solves Cases by Asking Yes/No Questions

Decision Trees: The Detective Who Solves Cases by Asking Yes/No Questions

Comments
16 min read
Information Gain & Entropy: The Game Show Host Who Learned to Ask Perfect Questions
Cover image for Information Gain & Entropy: The Game Show Host Who Learned to Ask Perfect Questions

Information Gain & Entropy: The Game Show Host Who Learned to Ask Perfect Questions

Comments
16 min read
Top 5 GIS Tools for Spatial Data Processing and Digital Twins

Top 5 GIS Tools for Spatial Data Processing and Digital Twins

Comments
4 min read
3D Skeleton Detection from Baseball Motion Capture Data with Driveline C3D

3D Skeleton Detection from Baseball Motion Capture Data with Driveline C3D

2
Comments
3 min read
Why Is It Called 'Logistic Regression' If It's Used for Classification? The Naming Mystery Explained
Cover image for Why Is It Called 'Logistic Regression' If It's Used for Classification? The Naming Mystery Explained

Why Is It Called 'Logistic Regression' If It's Used for Classification? The Naming Mystery Explained

Comments
8 min read
Why Marcel Beat LightGBM: Building an NPB Player Performance Prediction System

Why Marcel Beat LightGBM: Building an NPB Player Performance Prediction System

1
Comments
9 min read
Nonlinear Least Squares and Nonlinear Regression in R: Concepts, Origins, Applications, and Case Studies

Nonlinear Least Squares and Nonlinear Regression in R: Concepts, Origins, Applications, and Case Studies

1
Comments
5 min read
How I Self-Host Metabase & Why Your Team Should Too
Cover image for How I Self-Host Metabase & Why Your Team Should Too

How I Self-Host Metabase & Why Your Team Should Too

7
Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.