<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Anshika</title>
    <description>The latest articles on Forem by Anshika (@anshikalohan).</description>
    <link>https://forem.com/anshikalohan</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3300212%2Fa53f83f3-d65b-4d61-a382-66bb68ce8424.jpg</url>
      <title>Forem: Anshika</title>
      <link>https://forem.com/anshikalohan</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/anshikalohan"/>
    <language>en</language>
    <item>
      <title>My Deep Learning Journey with Andrew Ng</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Fri, 25 Jul 2025 12:20:23 +0000</pubDate>
      <link>https://forem.com/anshikalohan/my-deep-learning-journey-with-andrew-ng-4b77</link>
      <guid>https://forem.com/anshikalohan/my-deep-learning-journey-with-andrew-ng-4b77</guid>
      <description>&lt;h2&gt;
  
  
  Just Completed: Neural Networks and Deep Learning on Coursera!
&lt;/h2&gt;

&lt;p&gt;I'm excited to share that I've just finished the &lt;strong&gt;Neural Networks and Deep Learning&lt;/strong&gt; course on Coursera as part of the Deep Learning Specialization. This foundational course has been an incredible journey into the world of AI and machine learning!&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Learned
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Core Concepts Mastered:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Neural Network Fundamentals&lt;/strong&gt;: Understanding perceptrons, multi-layer networks, and the mathematical foundations behind them&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Forward and Backward Propagation&lt;/strong&gt;: Implementing the core algorithms that make neural networks learn&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Activation Functions&lt;/strong&gt;: Exploring sigmoid, tanh, ReLU, and their impact on network performance&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Gradient Descent Optimization&lt;/strong&gt;: Understanding how networks minimize cost functions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Deep Neural Networks&lt;/strong&gt;: Building and training networks with multiple hidden layers&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Hands-On Experience:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Implemented neural networks from scratch using Python and NumPy&lt;/li&gt;
&lt;li&gt;Built binary and multi-class classification models&lt;/li&gt;
&lt;li&gt;Worked with real datasets to solve practical problems&lt;/li&gt;
&lt;li&gt;Optimized network architectures and hyperparameters&lt;/li&gt;
&lt;li&gt;Developed intuition for debugging neural network performance&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Mathematical Foundation Matters&lt;/strong&gt;: The course emphasized understanding the underlying math rather than just using black-box libraries. This deep dive into linear algebra, calculus, and probability has given me a solid foundation for more advanced topics.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Implementation from Scratch&lt;/strong&gt;: Writing forward and backward propagation algorithms manually was challenging but incredibly valuable. It demystified how popular frameworks like TensorFlow and PyTorch work under the hood.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Hyperparameter Tuning is an Art&lt;/strong&gt;: Learning when to adjust learning rates, choose different activation functions, or modify network architecture based on performance metrics was eye-opening.&lt;/p&gt;

&lt;h2&gt;
  
  
  Practical Projects
&lt;/h2&gt;

&lt;p&gt;Some highlights from the programming assignments:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Logistic Regression as a Neural Network&lt;/strong&gt;: Understanding how simple logistic regression connects to neural network concepts&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Planar Data Classification&lt;/strong&gt;: Building a network to classify non-linearly separable data&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Deep Neural Network Application&lt;/strong&gt;: Creating a multi-layer network for image recognition tasks&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What's Next?
&lt;/h2&gt;

&lt;p&gt;This course is just the beginning! I'm planning to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Continue with the rest of the Deep Learning Specialization&lt;/li&gt;
&lt;li&gt;Apply these concepts to personal projects&lt;/li&gt;
&lt;li&gt;Explore computer vision and NLP applications&lt;/li&gt;
&lt;li&gt;Contribute to open-source ML projects&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  For Fellow Learners
&lt;/h2&gt;

&lt;p&gt;If you're considering this course, here's my advice:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Don't skip the math&lt;/strong&gt;: Even if it seems daunting, understanding the mathematical foundations pays off&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Code along actively&lt;/strong&gt;: Don't just watch the videos - implement everything yourself&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Experiment beyond assignments&lt;/strong&gt;: Try different parameters and see how they affect results&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Join study groups&lt;/strong&gt;: The discussion forums are incredibly helpful&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Resources That Helped Me
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Andrew Ng's clear explanations and intuitive examples&lt;/li&gt;
&lt;li&gt;The programming assignments with detailed starter code&lt;/li&gt;
&lt;li&gt;Supplementary reading on linear algebra and calculus&lt;/li&gt;
&lt;li&gt;Community discussions and peer interactions&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;The field of deep learning is evolving rapidly, and this course has given me the foundational knowledge to keep learning and growing. Excited to see where this journey takes me next!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Course&lt;/strong&gt;: &lt;a href="https://www.coursera.org/learn/neural-networks-deep-learning" rel="noopener noreferrer"&gt;Neural Networks and Deep Learning&lt;/a&gt;&lt;br&gt;&lt;br&gt;
&lt;strong&gt;Instructor&lt;/strong&gt;: Andrew Ng&lt;br&gt;&lt;br&gt;
&lt;strong&gt;Platform&lt;/strong&gt;: Coursera&lt;/p&gt;

</description>
      <category>deeplearning</category>
      <category>machinelearning</category>
      <category>ai</category>
      <category>learning</category>
    </item>
    <item>
      <title>Building a Breast Cancer Prediction App with Machine Learning and Streamlit</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Mon, 07 Jul 2025 12:02:15 +0000</pubDate>
      <link>https://forem.com/anshikalohan/building-a-breast-cancer-prediction-app-with-machine-learning-and-streamlit-57an</link>
      <guid>https://forem.com/anshikalohan/building-a-breast-cancer-prediction-app-with-machine-learning-and-streamlit-57an</guid>
      <description>&lt;p&gt;Medical AI is revolutionizing healthcare, and machine learning models are becoming powerful tools for early disease detection. In this comprehensive tutorial, I'll walk you through building a complete breast cancer prediction system using the Wisconsin Breast Cancer dataset.&lt;/p&gt;

&lt;h2&gt;
  
  
  What We'll Build
&lt;/h2&gt;

&lt;p&gt;By the end of this tutorial, you'll have:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A fully trained logistic regression model for cancer prediction&lt;/li&gt;
&lt;li&gt;An interactive Streamlit web application&lt;/li&gt;
&lt;li&gt;Comprehensive exploratory data analysis&lt;/li&gt;
&lt;li&gt;A complete GitHub repository ready for deployment&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Live Demo&lt;/strong&gt;: &lt;a href="https://breast-cancer-prediction-bjvjgeuuedrb8xossqqpvn.streamlit.app/" rel="noopener noreferrer"&gt;Streamlit app&lt;/a&gt;&lt;br&gt;&lt;br&gt;
&lt;strong&gt;GitHub Repository&lt;/strong&gt;: &lt;a href="https://github.com/anshikalohan/Breast-cancer-prediction" rel="noopener noreferrer"&gt;House Price Prediction&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Understanding the Dataset
&lt;/h2&gt;

&lt;p&gt;The Wisconsin Breast Cancer dataset contains 569 samples with 30 features each, computed from digitized images of breast mass fine needle aspirates. Each sample is classified as either:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Benign (B)&lt;/strong&gt;: Non-cancerous tumor&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Malignant (M)&lt;/strong&gt;: Cancerous tumor&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;
  
  
  🔧 Setting Up the Environment
&lt;/h2&gt;

&lt;p&gt;First, let's set up our development environment:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Create virtual environment&lt;/span&gt;
python &lt;span class="nt"&gt;-m&lt;/span&gt; venv breast_cancer_env
&lt;span class="nb"&gt;source &lt;/span&gt;breast_cancer_env/bin/activate  &lt;span class="c"&gt;# On Windows: breast_cancer_env\Scripts\activate&lt;/span&gt;

&lt;span class="c"&gt;# Install required packages&lt;/span&gt;
pip &lt;span class="nb"&gt;install &lt;/span&gt;pandas numpy scikit-learn matplotlib seaborn streamlit plotly joblib
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Exploratory Data Analysis
&lt;/h2&gt;

&lt;p&gt;The first step in any machine learning project is understanding your data. Here's what we discovered:&lt;/p&gt;

&lt;h3&gt;
  
  
  Key Insights:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Dataset Balance&lt;/strong&gt;: ~63% benign, ~37% malignant cases&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Feature Correlations&lt;/strong&gt;: Strong correlations between mean, SE, and worst values of the same measurements&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Distinguishing Features&lt;/strong&gt;: &lt;code&gt;concave_points_worst&lt;/code&gt;, &lt;code&gt;perimeter_worst&lt;/code&gt;, and &lt;code&gt;concave_points_mean&lt;/code&gt; show the highest correlation with malignancy&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Visualization Highlights:
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Target variable distribution
&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="nf"&gt;value_counts&lt;/span&gt;&lt;span class="p"&gt;().&lt;/span&gt;&lt;span class="nf"&gt;plot&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;kind&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;bar&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;title&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Distribution of Diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;show&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;

&lt;span class="c1"&gt;# Correlation matrix
&lt;/span&gt;&lt;span class="n"&gt;sns&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;heatmap&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;corr&lt;/span&gt;&lt;span class="p"&gt;(),&lt;/span&gt; &lt;span class="n"&gt;annot&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="bp"&gt;False&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;cmap&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;coolwarm&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;title&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Feature Correlation Matrix&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;show&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Building the Machine Learning Model
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Data Preprocessing
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Convert diagnosis to binary
&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="nf"&gt;map&lt;/span&gt;&lt;span class="p"&gt;({&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;B&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;M&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;})&lt;/span&gt;

&lt;span class="c1"&gt;# Separate features and target
&lt;/span&gt;&lt;span class="n"&gt;X&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;drop&lt;/span&gt;&lt;span class="p"&gt;([&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;id&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt; &lt;span class="n"&gt;axis&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;y&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;diagnosis&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;

&lt;span class="c1"&gt;# Feature scaling
&lt;/span&gt;&lt;span class="n"&gt;scaler&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;StandardScaler&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;X_scaled&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;scaler&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;fit_transform&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;X&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Model Training
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Split the data
&lt;/span&gt;&lt;span class="n"&gt;X_train&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;X_test&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_train&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_test&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;train_test_split&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="n"&gt;X_scaled&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;test_size&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mf"&gt;0.3&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;random_state&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;42&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;# Train logistic regression
&lt;/span&gt;&lt;span class="n"&gt;lr&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;LogisticRegression&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;lr&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;fit&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;X_train&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_train&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;# Evaluate
&lt;/span&gt;&lt;span class="n"&gt;y_pred&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;predict&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;X_test&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;accuracy&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;accuracy_score&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;y_test&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_pred&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sa"&gt;f&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Accuracy: &lt;/span&gt;&lt;span class="si"&gt;{&lt;/span&gt;&lt;span class="n"&gt;accuracy&lt;/span&gt;&lt;span class="si"&gt;:&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="n"&gt;f&lt;/span&gt;&lt;span class="si"&gt;}&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Model Performance
&lt;/h3&gt;

&lt;p&gt;Our logistic regression model achieved impressive results:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Accuracy&lt;/strong&gt;: 98%&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Precision&lt;/strong&gt;: High precision for both classes&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recall&lt;/strong&gt;: Excellent recall for malignant cases&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Medical Disclaimer &amp;amp; Ethics
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Important&lt;/strong&gt;: This application is for educational purposes only. Key considerations:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Always consult qualified healthcare professionals&lt;/li&gt;
&lt;li&gt;AI should augment, not replace, medical expertise&lt;/li&gt;
&lt;li&gt;Consider bias in training data&lt;/li&gt;
&lt;li&gt;Ensure patient data privacy and security&lt;/li&gt;
&lt;li&gt;Regular model retraining and validation&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Deployment Options
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Local Development
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;streamlit run app.py
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Streamlit Cloud
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;Push code to GitHub&lt;/li&gt;
&lt;li&gt;Connect repository to Streamlit Cloud&lt;/li&gt;
&lt;li&gt;Deploy with one click&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Docker Deployment
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight docker"&gt;&lt;code&gt;&lt;span class="k"&gt;FROM&lt;/span&gt;&lt;span class="s"&gt; python:3.9-slim&lt;/span&gt;
&lt;span class="k"&gt;COPY&lt;/span&gt;&lt;span class="s"&gt; requirements.txt .&lt;/span&gt;
&lt;span class="k"&gt;RUN &lt;/span&gt;pip &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;-r&lt;/span&gt; requirements.txt
&lt;span class="k"&gt;COPY&lt;/span&gt;&lt;span class="s"&gt; . .&lt;/span&gt;
&lt;span class="k"&gt;EXPOSE&lt;/span&gt;&lt;span class="s"&gt; 8501&lt;/span&gt;
&lt;span class="k"&gt;CMD&lt;/span&gt;&lt;span class="s"&gt; ["streamlit", "run", "app.py"]&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Future Enhancements
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Model Improvements
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Ensemble Methods&lt;/strong&gt;: Random Forest, XGBoost&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Deep Learning&lt;/strong&gt;: Neural networks for complex patterns&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Feature Engineering&lt;/strong&gt;: Automated feature selection&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Application Features
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Multi-language Support&lt;/strong&gt;: Reach global healthcare providers&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;API Integration&lt;/strong&gt;: Connect with hospital systems&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mobile App&lt;/strong&gt;: Native iOS/Android applications&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Real-time Monitoring&lt;/strong&gt;: Track model performance&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Advanced Analytics
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Explainable AI&lt;/strong&gt;: SHAP values for feature importance&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Uncertainty Quantification&lt;/strong&gt;: Confidence intervals&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Bias Detection&lt;/strong&gt;: Fairness across demographic groups&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Key Takeaways
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Data Quality Matters&lt;/strong&gt;: Clean, well-preprocessed data is crucial&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model Simplicity&lt;/strong&gt;: Logistic regression can be highly effective&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;User Experience&lt;/strong&gt;: Medical applications need intuitive interfaces&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Validation is Critical&lt;/strong&gt;: Rigorous testing ensures reliability&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Ethical Considerations&lt;/strong&gt;: Always prioritize patient safety&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Technical Stack Summary
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Data Science&lt;/strong&gt;: pandas, numpy, scikit-learn&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Visualization&lt;/strong&gt;: matplotlib, seaborn, plotly&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Web Framework&lt;/strong&gt;: Streamlit&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Deployment&lt;/strong&gt;: Streamlit Cloud, Docker&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Version Control&lt;/strong&gt;: Git, GitHub&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Resources &amp;amp; References
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.kaggle.com/datasets/uciml/breast-cancer-wisconsin-data" rel="noopener noreferrer"&gt;Wisconsin Breast Cancer Dataset&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://docs.streamlit.io/" rel="noopener noreferrer"&gt;Streamlit Documentation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://scikit-learn.org/stable/user_guide.html" rel="noopener noreferrer"&gt;Scikit-learn User Guide&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://plotly.com/python/" rel="noopener noreferrer"&gt;Plotly Python Documentation&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Building this breast cancer prediction system taught me the importance of combining technical excellence with ethical responsibility. Machine learning in healthcare requires not just accurate models, but also thoughtful user experience design and careful consideration of real-world implications.&lt;/p&gt;

&lt;p&gt;The project demonstrates how modern tools like Streamlit can democratize AI deployment, making sophisticated machine learning models accessible to healthcare professionals without extensive technical backgrounds.&lt;/p&gt;

&lt;p&gt;Remember: the goal isn't to replace medical professionals, but to provide them with powerful tools that can help save lives through early detection and improved diagnosis accuracy.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Have you built similar healthcare ML applications? What challenges did you face? Share your experiences in the comments below!&lt;/em&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;If you found this helpful, please give it a ❤️ and consider following for more AI and machine learning content!&lt;/em&gt;&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Building My First End-to-End Machine Learning Project</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Sun, 06 Jul 2025 19:42:39 +0000</pubDate>
      <link>https://forem.com/anshikalohan/building-my-first-end-to-end-machine-learning-project-23a3</link>
      <guid>https://forem.com/anshikalohan/building-my-first-end-to-end-machine-learning-project-23a3</guid>
      <description>&lt;p&gt;&lt;em&gt;A complete journey from data to deployment with Python, Scikit-learn, and Streamlit&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;As a budding data scientist, I wanted to create a comprehensive machine learning project that showcases the entire ML pipeline - from data preprocessing to model deployment. Today, I'm excited to share my &lt;strong&gt;House Price Prediction&lt;/strong&gt; project that predicts real estate prices using machine learning!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Live Demo&lt;/strong&gt;: &lt;a href="https://house-price-prediction-o9e737nhcbg5syapkubnck.streamlit.app/" rel="noopener noreferrer"&gt;Streamlit app&lt;/a&gt;&lt;br&gt;&lt;br&gt;
&lt;strong&gt;GitHub Repository&lt;/strong&gt;: &lt;a href="https://github.com/anshikalohan/house-price-prediction/tree/main" rel="noopener noreferrer"&gt;House Price Prediction&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  Project Overview
&lt;/h2&gt;

&lt;p&gt;This project predicts house prices based on various features like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Median income in the area&lt;/li&gt;
&lt;li&gt;House age and size characteristics&lt;/li&gt;
&lt;li&gt;Population and demographic data&lt;/li&gt;
&lt;li&gt;Geographic location&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal was to build a &lt;strong&gt;real-world applicable model&lt;/strong&gt; with a &lt;strong&gt;user-friendly interface&lt;/strong&gt; that anyone can use to get instant price predictions.&lt;/p&gt;
&lt;h2&gt;
  
  
  Tech Stack
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Python&lt;/strong&gt;: Core programming language&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scikit-learn&lt;/strong&gt;: Machine learning algorithms&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Streamlit&lt;/strong&gt;: Web application framework&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pandas &amp;amp; NumPy&lt;/strong&gt;: Data manipulation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Matplotlib &amp;amp; Seaborn&lt;/strong&gt;: Data visualization&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Plotly&lt;/strong&gt;: Interactive charts&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;
  
  
  The Dataset
&lt;/h2&gt;

&lt;p&gt;I used the &lt;strong&gt;California Housing Dataset&lt;/strong&gt; containing 20,640 samples with features like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Median income&lt;/li&gt;
&lt;li&gt;House age&lt;/li&gt;
&lt;li&gt;Average rooms/bedrooms&lt;/li&gt;
&lt;li&gt;Population density&lt;/li&gt;
&lt;li&gt;Geographic coordinates&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This dataset is perfect for learning because it's:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Real-world data&lt;/li&gt;
&lt;li&gt;Clean and well-structured&lt;/li&gt;
&lt;li&gt;Sufficient size for training&lt;/li&gt;
&lt;li&gt;Interpretable features&lt;/li&gt;
&lt;/ul&gt;
&lt;h2&gt;
  
  
  Key Steps in My ML Pipeline
&lt;/h2&gt;
&lt;h3&gt;
  
  
  1. Exploratory Data Analysis (EDA)
&lt;/h3&gt;

&lt;p&gt;First, I dove deep into understanding the data:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Check data distribution
&lt;/span&gt;&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;figure&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;figsize&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;15&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;))&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;hist&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;bins&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;30&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;alpha&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mf"&gt;0.7&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;suptitle&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Feature Distributions&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;show&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;

&lt;span class="c1"&gt;# Correlation analysis
&lt;/span&gt;&lt;span class="n"&gt;correlation_matrix&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;corr&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;sns&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;heatmap&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;correlation_matrix&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;annot&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="bp"&gt;True&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;cmap&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;coolwarm&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Key Insights:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Median income has the strongest correlation with price (0.69)&lt;/li&gt;
&lt;li&gt;Location (latitude/longitude) significantly impacts pricing&lt;/li&gt;
&lt;li&gt;House age has a moderate negative correlation&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  2. Feature Engineering
&lt;/h3&gt;

&lt;p&gt;I created three new features to improve model performance:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Engineer new features
&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;rooms_per_household&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;AveRooms&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;AveOccup&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;population_per_household&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Population&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;HouseHolds&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;bedrooms_per_room&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;AveBedrms&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;AveRooms&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;These engineered features provided better insights into housing quality and density.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Data Preprocessing
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Handle outliers using IQR method
&lt;/span&gt;&lt;span class="n"&gt;Q1&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;price&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="nf"&gt;quantile&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mf"&gt;0.25&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;Q3&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;price&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="nf"&gt;quantile&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mf"&gt;0.75&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;IQR&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;Q3&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="n"&gt;Q1&lt;/span&gt;
&lt;span class="n"&gt;df_clean&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;price&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;=&lt;/span&gt; &lt;span class="n"&gt;Q1&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="mf"&gt;1.5&lt;/span&gt;&lt;span class="o"&gt;*&lt;/span&gt;&lt;span class="n"&gt;IQR&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;&amp;amp;&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;price&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;=&lt;/span&gt; &lt;span class="n"&gt;Q3&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt; &lt;span class="mf"&gt;1.5&lt;/span&gt;&lt;span class="o"&gt;*&lt;/span&gt;&lt;span class="n"&gt;IQR&lt;/span&gt;&lt;span class="p"&gt;)]&lt;/span&gt;

&lt;span class="c1"&gt;# Scale features
&lt;/span&gt;&lt;span class="n"&gt;scaler&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;StandardScaler&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;X_scaled&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;scaler&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;fit_transform&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;X&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  4. Model Training &amp;amp; Evaluation
&lt;/h3&gt;

&lt;p&gt;I chose Linear Regression for interpretability:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;model&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;LinearRegression&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;fit&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;X_train_scaled&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_train&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;# Evaluate performance
&lt;/span&gt;&lt;span class="n"&gt;test_r2&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;r2_score&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;y_test&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_pred&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;test_rmse&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;np&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;sqrt&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nf"&gt;mean_squared_error&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;y_test&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y_pred&lt;/span&gt;&lt;span class="p"&gt;))&lt;/span&gt;

&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sa"&gt;f&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;R² Score: &lt;/span&gt;&lt;span class="si"&gt;{&lt;/span&gt;&lt;span class="n"&gt;test_r2&lt;/span&gt;&lt;span class="si"&gt;:&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="mi"&gt;4&lt;/span&gt;&lt;span class="n"&gt;f&lt;/span&gt;&lt;span class="si"&gt;}&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sa"&gt;f&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;RMSE: $&lt;/span&gt;&lt;span class="si"&gt;{&lt;/span&gt;&lt;span class="n"&gt;test_rmse&lt;/span&gt;&lt;span class="o"&gt;*&lt;/span&gt;&lt;span class="mi"&gt;100&lt;/span&gt;&lt;span class="si"&gt;:&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="n"&gt;f&lt;/span&gt;&lt;span class="si"&gt;}&lt;/span&gt;&lt;span class="s"&gt;k&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Model Performance:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;R² Score&lt;/strong&gt;: 0.60 (explains 60% of price variance)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;RMSE&lt;/strong&gt;: ~$68k&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MAE&lt;/strong&gt;: ~$50k&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Building the Web Application
&lt;/h2&gt;

&lt;p&gt;The most exciting part was creating an interactive web app using Streamlit.&lt;/p&gt;

&lt;h3&gt;
  
  
  App Features:
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Interactive sliders&lt;/strong&gt; for all input features&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Real-time predictions&lt;/strong&gt; with instant results&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Visualization&lt;/strong&gt; of results and comparisons&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Feature importance&lt;/strong&gt; explanations&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mobile-responsive&lt;/strong&gt; design&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Results &amp;amp; Insights
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Model Performance
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Successfully predicts house prices with &lt;strong&gt;60% accuracy&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Identifies &lt;strong&gt;median income&lt;/strong&gt; as the strongest price predictor&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Location factors&lt;/strong&gt; (lat/long) significantly impact pricing&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Engineered features&lt;/strong&gt; improved model performance by 5%&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Key Learnings
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Feature engineering&lt;/strong&gt; can significantly boost model performance&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data visualization&lt;/strong&gt; is crucial for understanding patterns&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model interpretability&lt;/strong&gt; is as important as accuracy&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;User experience&lt;/strong&gt; matters in ML applications&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Future Improvements
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Advanced Algorithms&lt;/strong&gt;: Implement Random Forest, XGBoost&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Hyperparameter Tuning&lt;/strong&gt;: Use GridSearchCV for optimization&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cross-Validation&lt;/strong&gt;: Implement k-fold cross-validation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Real-time Data&lt;/strong&gt;: Integrate with real estate APIs&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model Monitoring&lt;/strong&gt;: Add performance tracking&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cloud Deployment&lt;/strong&gt;: Deploy on AWS/GCP for scalability&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Lessons Learned
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Technical Lessons
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Data quality&lt;/strong&gt; is more important than model complexity&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Feature engineering&lt;/strong&gt; often beats algorithm selection&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model interpretability&lt;/strong&gt; is crucial for business applications&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;User interface&lt;/strong&gt; design significantly impacts adoption&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Project Management
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Documentation&lt;/strong&gt; is essential for portfolio projects&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Version control&lt;/strong&gt; (Git) saves time and prevents disasters&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Modular code&lt;/strong&gt; makes debugging and improvements easier&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Testing&lt;/strong&gt; with sample data prevents deployment issues&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Impact on My Learning Journey
&lt;/h2&gt;

&lt;p&gt;This project has significantly enhanced my skills in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;End-to-end ML pipeline&lt;/strong&gt; development&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data preprocessing&lt;/strong&gt; and feature engineering&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model evaluation&lt;/strong&gt; and interpretation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Web application&lt;/strong&gt; development&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Project documentation&lt;/strong&gt; and presentation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Version control&lt;/strong&gt; and collaboration&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I'd love to hear your thoughts! Please:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Star&lt;/strong&gt; the GitHub repository if you find it useful&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Comment&lt;/strong&gt; with suggestions or questions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Share&lt;/strong&gt; if you think others might benefit&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Connect with me:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.linkedin.com/in/anshika-lohan-570484273/" rel="noopener noreferrer"&gt;LinkedIn&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://github.com/anshikalohan" rel="noopener noreferrer"&gt;GitHub&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;What's your first ML project story? Share in the comments below! 👇&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;This post chronicles my journey building my first complete ML project. The code, data, and live demo are all available for you to explore, learn from, and build upon. Happy coding!&lt;/em&gt;&lt;/p&gt;

</description>
      <category>programming</category>
      <category>ai</category>
      <category>deeplearning</category>
      <category>beginners</category>
    </item>
    <item>
      <title>Unsupervised Learning Finally Makes Sense – My Journey Through ML Course 3</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Sat, 05 Jul 2025 17:29:25 +0000</pubDate>
      <link>https://forem.com/anshikalohan/unsupervised-learning-finally-makes-sense-my-journey-through-ml-course-3-apo</link>
      <guid>https://forem.com/anshikalohan/unsupervised-learning-finally-makes-sense-my-journey-through-ml-course-3-apo</guid>
      <description>&lt;p&gt;Hey everyone!&lt;br&gt;&lt;br&gt;
I'm so happy to share that I’ve officially completed the &lt;strong&gt;entire Machine Learning Specialization&lt;/strong&gt; by &lt;strong&gt;Andrew Ng&lt;/strong&gt; on &lt;strong&gt;Coursera&lt;/strong&gt; — a journey that’s helped me build a solid foundation in both &lt;strong&gt;core ML theory&lt;/strong&gt; and &lt;strong&gt;hands-on application&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;This was the &lt;strong&gt;third and final course&lt;/strong&gt; in the series, titled:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;“Unsupervised Learning, Recommenders, Reinforcement Learning”&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
by DeepLearning.AI &amp;amp; Stanford University&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  What This Final Course Covered
&lt;/h2&gt;

&lt;p&gt;This last course introduced some really exciting and practical machine learning areas that go beyond supervised learning:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Unsupervised Learning&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;K-Means Clustering&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Anomaly Detection&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Principal Component Analysis (PCA)&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Recommender Systems&lt;/strong&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Content-based filtering
&lt;/li&gt;
&lt;li&gt;Collaborative filtering with matrix factorization&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;

&lt;p&gt;&lt;strong&gt;Introduction to Reinforcement Learning&lt;/strong&gt; &lt;em&gt;(theoretical only)&lt;/em&gt;  &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What RL is and how it differs from supervised/unsupervised learning
&lt;/li&gt;
&lt;li&gt;High-level applications like robotics and game-playing agents&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;Although reinforcement learning wasn’t covered in depth (no coding for it), it was a great introduction to the concept and its use cases.&lt;/p&gt;




&lt;h2&gt;
  
  
  Concepts That Stuck With Me
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Unsupervised learning&lt;/strong&gt; helps uncover hidden patterns in unlabeled data.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;K-Means Clustering&lt;/strong&gt; is simple but powerful for grouping similar data points — great for tasks like customer segmentation.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Anomaly Detection&lt;/strong&gt; is critical in areas like fraud detection and system health monitoring.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;PCA&lt;/strong&gt; helps reduce the dimensionality of high-dimensional datasets while preserving variance — useful for both visualization and performance.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Recommender Systems&lt;/strong&gt; use data cleverly to personalize experiences — I now have a better understanding of what powers platforms like Netflix and Spotify!&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Tools and Frameworks I Used
&lt;/h2&gt;

&lt;p&gt;Throughout the specialization, I worked with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python
&lt;/li&gt;
&lt;li&gt;NumPy, pandas, matplotlib
&lt;/li&gt;
&lt;li&gt;Jupyter Notebooks &amp;amp; Google Colab
&lt;/li&gt;
&lt;li&gt;Implemented algorithms from scratch to better understand the math&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Practice Highlights
&lt;/h2&gt;

&lt;p&gt;Some of the hands-on work included:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Visualizing gene expression data with PCA
&lt;/li&gt;
&lt;li&gt;Building a basic movie recommender system
&lt;/li&gt;
&lt;li&gt;Detecting anomalies in server and sensor data&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;All exercises were designed to feel like real-world applications — not just theory!&lt;/p&gt;




&lt;h2&gt;
  
  
  My ML Journey So Far
&lt;/h2&gt;

&lt;p&gt;This post marks the completion of my &lt;strong&gt;Machine Learning Specialization&lt;/strong&gt;:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="https://dev.to/anshikalohan/supervised-machine-learning-concepts-i-finally-understand-3d1"&gt;&lt;strong&gt;Supervised Machine Learning: Concepts I Finally Understand&lt;/strong&gt;&lt;/a&gt;&lt;br&gt;&lt;br&gt;
→ Linear/Logistic Regression, Loss functions, Evaluation Metrics&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;a href="https://dev.to/anshikalohan/advanced-learning-algorithms-concepts-that-finally-clicked-30a8"&gt;&lt;strong&gt;Advanced Learning Algorithms: Concepts That Finally Clicked&lt;/strong&gt;&lt;/a&gt;&lt;br&gt;&lt;br&gt;
→ Neural networks, forward/backward propagation, and building models from scratch&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;This post&lt;/strong&gt; — Unsupervised learning, recommendation systems, and a peek into reinforcement learning!&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  What’s Next?
&lt;/h2&gt;

&lt;p&gt;Now that I’ve wrapped up this specialization, here’s what I plan to do next:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Build end-to-end ML projects combining supervised &amp;amp; unsupervised learning
&lt;/li&gt;
&lt;li&gt;Dive into &lt;strong&gt;Generative AI&lt;/strong&gt;, &lt;strong&gt;LLMs&lt;/strong&gt;, and &lt;strong&gt;NLP&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Compete in &lt;strong&gt;Kaggle challenges&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Continue sharing my learnings right here on Dev.to!&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;Thanks so much for following along with my ML journey&lt;br&gt;&lt;br&gt;
Let me know if you’re also learning ML or building something cool — I’d love to connect!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Happy Learning!&lt;/strong&gt;&lt;/p&gt;

</description>
      <category>programming</category>
      <category>ai</category>
      <category>python</category>
      <category>learning</category>
    </item>
    <item>
      <title>Advanced Learning Algorithms: Concepts That Finally Clicked</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Wed, 02 Jul 2025 11:36:15 +0000</pubDate>
      <link>https://forem.com/anshikalohan/advanced-learning-algorithms-concepts-that-finally-clicked-30a8</link>
      <guid>https://forem.com/anshikalohan/advanced-learning-algorithms-concepts-that-finally-clicked-30a8</guid>
      <description>&lt;p&gt;After writing about the basics of supervised machine learning, I went one step further and completed the second course in the &lt;a href="https://www.coursera.org/specializations/machine-learning-introduction" rel="noopener noreferrer"&gt;Machine Learning Specialization by Andrew Ng&lt;/a&gt;. This one was a game-changer — it covered the &lt;em&gt;why&lt;/em&gt; behind how machines learn, especially when things start to get nonlinear and complex.&lt;/p&gt;

&lt;p&gt;Here are the key concepts that &lt;em&gt;finally clicked&lt;/em&gt; for me&lt;/p&gt;




&lt;h2&gt;
  
  
  1. Regularization — Not Just a Buzzword
&lt;/h2&gt;

&lt;p&gt;I used to hear "regularization" everywhere, but I didn’t &lt;em&gt;really&lt;/em&gt; understand what it meant.&lt;/p&gt;

&lt;p&gt;Turns out, it’s like &lt;strong&gt;teaching your model not to overthink&lt;/strong&gt;. Too many weights = too much memorizing = poor generalization. L2 regularization (adding a penalty term) helps reduce those extreme weight values and keeps your model grounded.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Regularization isn’t just a math trick — it’s essential for better generalization.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  2. Neural Networks — Finally Got the Intuition
&lt;/h2&gt;

&lt;p&gt;Neural networks always sounded intimidating. But once I saw how a simple neural net is just a bunch of logistic regressions stacked and activated, it clicked.&lt;/p&gt;

&lt;p&gt;I now understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;How each layer transforms data&lt;/li&gt;
&lt;li&gt;Why activation functions like ReLU or sigmoid matter&lt;/li&gt;
&lt;li&gt;What it means to &lt;em&gt;learn weights&lt;/em&gt; through backpropagation&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Neural nets are just math + layering + learning — no magic, just logic.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  3. Backpropagation — The Learning Engine
&lt;/h2&gt;

&lt;p&gt;This was the hardest part at first. Chain rule? Gradients? But visualizing how errors move backward through layers to update weights made it clear.&lt;/p&gt;

&lt;p&gt;Now I know:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The loss tells us how wrong we were&lt;/li&gt;
&lt;li&gt;Gradients tell us how to fix it&lt;/li&gt;
&lt;li&gt;Backpropagation adjusts all layers efficiently&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Backpropagation is how the network learns — by tweaking each layer's weights based on the output error.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  4. Deep vs. Shallow Models
&lt;/h2&gt;

&lt;p&gt;Shallow models (like logistic regression) work fine for simple data. But deeper networks capture &lt;em&gt;complex patterns&lt;/em&gt;, like images or sequences.&lt;/p&gt;

&lt;p&gt;I learned:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Why adding more layers lets us learn hierarchical features&lt;/li&gt;
&lt;li&gt;How depth adds power — but also complexity and risk of overfitting&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Depth adds capability, but only if used wisely.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  5. TensorFlow — My First Real ML Framework
&lt;/h2&gt;

&lt;p&gt;This was my first time working with &lt;strong&gt;TensorFlow&lt;/strong&gt;, and it really helped bridge the gap between theory and code.&lt;/p&gt;

&lt;p&gt;Using &lt;code&gt;tf.keras&lt;/code&gt;, I could:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Build neural networks in just a few lines&lt;/li&gt;
&lt;li&gt;Train models and track accuracy/loss in real-time&lt;/li&gt;
&lt;li&gt;Understand how each concept from the course translates into actual working code&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; TensorFlow makes ML implementation accessible — and fun!&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  6. Model Tuning Matters (More Than I Thought)
&lt;/h2&gt;

&lt;p&gt;Before, I underestimated how important things like learning rate, initialization, and number of units were. Now I realize:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Poor weight initialization can kill training&lt;/li&gt;
&lt;li&gt;Learning rate can make or break convergence&lt;/li&gt;
&lt;li&gt;You need trial and error (and patience)&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Tuning isn’t optional — it’s part of the craft.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  What’s Next?
&lt;/h2&gt;

&lt;p&gt;Up next, I’m diving into &lt;strong&gt;Unsupervised Learning, Recommenders, and Reinforcement Learning&lt;/strong&gt; — the third course in the specialization. I’m excited to explore clustering algorithms, anomaly detection, and even get a taste of how reinforcement learning works!&lt;/p&gt;

&lt;p&gt;And yes — I plan to keep building with TensorFlow too!&lt;br&gt;&lt;br&gt;
I’ll share my takeaways from that soon. Until then — happy learning.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;If you're just getting started with ML, feel free to check out my first post:&lt;/em&gt;&lt;br&gt;&lt;br&gt;
👉 &lt;a href="https://dev.to/anshika_1f2941065e4fa7a77/supervised-machine-learning-concepts-i-finally-understand-3d1"&gt;Supervised Machine Learning: Concepts I Finally Understand&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>ai</category>
      <category>learning</category>
      <category>beginners</category>
    </item>
    <item>
      <title>Supervised Machine Learning: Concepts I Finally Understand</title>
      <dc:creator>Anshika</dc:creator>
      <pubDate>Fri, 27 Jun 2025 11:50:00 +0000</pubDate>
      <link>https://forem.com/anshikalohan/supervised-machine-learning-concepts-i-finally-understand-3d1</link>
      <guid>https://forem.com/anshikalohan/supervised-machine-learning-concepts-i-finally-understand-3d1</guid>
      <description>&lt;p&gt;Hi, I'm Anshika — a B.Tech student diving into the world of AI and Machine Learning.&lt;/p&gt;

&lt;p&gt;I just completed &lt;strong&gt;Andrew Ng’s Supervised Machine Learning course&lt;/strong&gt; on Coursera (the first in the ML Specialization by DeepLearning.AI), and I wanted to document my learnings, struggles, and next steps as I begin my ML journey.&lt;/p&gt;




&lt;h2&gt;
  
  
  What is Supervised Machine Learning?
&lt;/h2&gt;

&lt;p&gt;Supervised ML is about &lt;strong&gt;teaching machines using labeled data&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;You provide inputs (features) along with the correct outputs (labels), and the model learns to predict outputs for new, unseen inputs.&lt;/p&gt;

&lt;p&gt;There are two key types:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Regression&lt;/strong&gt; → Predict continuous values (e.g., house price, traffic speed)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Classification&lt;/strong&gt; → Predict categories (e.g., spam vs. not spam)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Key Concepts I Learned
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Linear Regression (with one and multiple variables)&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Gradient Descent&lt;/strong&gt; – how the model "learns"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cost Function (Mean Squared Error)&lt;/strong&gt; – measuring how wrong the model is&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Logistic Regression&lt;/strong&gt; – used for binary classification problems&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Overfitting vs. Underfitting&lt;/strong&gt; – finding the balance between simplicity and accuracy&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Regularization (L2)&lt;/strong&gt; – prevents the model from overfitting the training data&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Tools I Used
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Python&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;NumPy&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;Jupyter Notebook (for practice exercises)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What Helped Me Understand Better
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Visualizing gradient descent and cost function graphs&lt;/li&gt;
&lt;li&gt;Coding linear regression &lt;strong&gt;from scratch&lt;/strong&gt; before using libraries&lt;/li&gt;
&lt;li&gt;Reading discussion forums whenever I got stuck&lt;/li&gt;
&lt;li&gt;Taking handwritten notes to simplify complex terms&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What’s Next?
&lt;/h2&gt;

&lt;p&gt;Now that I’ve finished the Supervised Learning course, I plan to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Continue the specialization: &lt;strong&gt;Next up → Unsupervised Learning&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Apply regression to a real-world dataset (maybe traffic or energy!)&lt;/li&gt;
&lt;li&gt;Start writing beginner-friendly tutorials alongside learning&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;If you’re on a similar journey or just starting out — feel free to reach out! Let’s learn and build together.&lt;/p&gt;

&lt;p&gt;Thanks for reading! &lt;/p&gt;




&lt;p&gt;🔗 &lt;strong&gt;Connect with me:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;GitHub: &lt;a href="https://github.com/anshikalohan" rel="noopener noreferrer"&gt;https://github.com/anshikalohan&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;LinkedIn: &lt;a href="https://www.linkedin.com/in/anshika-lohan-570484273/" rel="noopener noreferrer"&gt;https://www.linkedin.com/in/anshika-lohan-570484273/&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>machinelearning</category>
      <category>python</category>
      <category>coursera</category>
      <category>ai</category>
    </item>
  </channel>
</rss>
