<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: jayanth anbu</title>
    <description>The latest articles on Forem by jayanth anbu (@jayanth_anbu_28041adad45b).</description>
    <link>https://forem.com/jayanth_anbu_28041adad45b</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3602182%2F8b9cf8dc-a09b-487b-be2c-553c5d7a88ad.jpg</url>
      <title>Forem: jayanth anbu</title>
      <link>https://forem.com/jayanth_anbu_28041adad45b</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/jayanth_anbu_28041adad45b"/>
    <language>en</language>
    <item>
      <title>Hands-on Data Cleaning Using Pandas in Google Colab</title>
      <dc:creator>jayanth anbu</dc:creator>
      <pubDate>Sat, 08 Nov 2025 06:22:17 +0000</pubDate>
      <link>https://forem.com/jayanth_anbu_28041adad45b/hands-on-data-cleaning-using-pandas-in-google-colab-4jno</link>
      <guid>https://forem.com/jayanth_anbu_28041adad45b/hands-on-data-cleaning-using-pandas-in-google-colab-4jno</guid>
      <description>&lt;p&gt;Data cleaning is one of the most crucial steps in any data science or analytics project. In this challenge, I worked on a real-world dataset from Kaggle with over 100,000 rows, performing various Pandas operations to clean, preprocess, and prepare it for further analysis.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📂 Dataset Details&lt;/strong&gt;&lt;br&gt;
For this challenge, I selected the E-commerce Sales Dataset from Kaggle containing around 120,000 rows and 12 columns.&lt;/p&gt;

&lt;p&gt;It includes data such as:&lt;/p&gt;

&lt;p&gt;🧾 Order ID&lt;br&gt;
👤 Customer Name&lt;br&gt;
🛒 Product &amp;amp; Quantity&lt;br&gt;
💰 Sales &amp;amp; Discount&lt;br&gt;
🌍 Region&lt;br&gt;
📅 Order Date&lt;br&gt;
Before Cleaning:&lt;/p&gt;

&lt;p&gt;Rows → 120,000&lt;br&gt;
Columns → 12&lt;br&gt;
File format → .csv&lt;/p&gt;

&lt;p&gt;⚙️ Tools &amp;amp; Environment&lt;br&gt;
Python 3&lt;br&gt;
Google Colab&lt;br&gt;
Libraries: Pandas, NumPy, Matplotlib&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxa5v4b9m3vywi4f68ta1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxa5v4b9m3vywi4f68ta1.png" alt=" " width="800" height="226"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>tutorial</category>
      <category>challenge</category>
      <category>python</category>
    </item>
    <item>
      <title>Exploring NoSQL Data Analysis: A Practical Study Using a Kaggle Dataset</title>
      <dc:creator>jayanth anbu</dc:creator>
      <pubDate>Sat, 08 Nov 2025 06:04:03 +0000</pubDate>
      <link>https://forem.com/jayanth_anbu_28041adad45b/exploring-nosql-data-analysis-a-practical-study-using-a-kaggle-dataset-1fgn</link>
      <guid>https://forem.com/jayanth_anbu_28041adad45b/exploring-nosql-data-analysis-a-practical-study-using-a-kaggle-dataset-1fgn</guid>
      <description>&lt;p&gt;&lt;strong&gt;🗂️ Step 1: Setting up MongoDB Atlas&lt;/strong&gt;&lt;br&gt;
Go to MongoDB Atlas.&lt;br&gt;
Create a free cluster (use the Shared Tier option).&lt;br&gt;
Under Network Access, add your IP:&lt;br&gt;
Click Network Access → Add IP Address → Allow access from anywhere (0.0.0.0/0).&lt;br&gt;
Create a database user and remember the credentials. Example:&lt;br&gt;
&lt;code&gt;&lt;br&gt;
Username: 22cs056&lt;br&gt;
Password: JAYANTH&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Once your cluster is ready, click “Connect → Connect using MongoDB Shell” and copy the connection string.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F339lzb57ccg3ysshfi0h.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F339lzb57ccg3ysshfi0h.png" alt=" " width="800" height="618"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;💻 Step 2: Connect from Mongo Shell&lt;/strong&gt;&lt;br&gt;
Open PowerShell or Command Prompt, then run:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;bash&lt;br&gt;
mongosh "mongodb+srv://m0.wpjmxqh.mongodb.net/" --apiVersion 1 --username 22cs098_db_user&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then enter your password when prompted:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;&lt;br&gt;
Enter password: JAYANTH&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;If connection succeeds, you’ll see:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;&lt;br&gt;
Atlas atlas-xxxx-shard-0 [primary]&amp;gt;&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fz2aq0zqtdrjnszk2c3w8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fz2aq0zqtdrjnszk2c3w8.png" alt=" " width="800" height="431"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📥 Step 3: Create a Database and Insert Records&lt;/strong&gt;&lt;br&gt;
Switch to a database (it will auto-create):&lt;/p&gt;

&lt;p&gt;&lt;code&gt;javascript&lt;br&gt;
use businessDB&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Insert 10 sample business review records:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;javascript&lt;br&gt;
db.reviews.insertMany([&lt;br&gt;
{ "business_id": "B001", "name": "Cafe Aroma", "rating": 4.6, "review": "Good food and fast service!", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B002", "name": "Pizza Palace", "rating": 4.8, "review": "Amazing crust and cheese quality!", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B003", "name": "Tea Time", "rating": 4.2, "review": "Nice ambience and friendly staff.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B004", "name": "Sweet Treats", "rating": 3.9, "review": "Desserts were good but service was slow.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B005", "name": "Veggie Delight", "rating": 4.1, "review": "Healthy food with good taste.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B006", "name": "Burger Hub", "rating": 4.9, "review": "Best burgers ever!", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B007", "name": "Ocean Dine", "rating": 4.7, "review": "Fresh seafood and great view.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B008", "name": "Spice Route", "rating": 3.8, "review": "Food was okay, but spicy.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B009", "name": "Bakers Street", "rating": 4.5, "review": "Good pastries and coffee.", "date": "2025-11-07" },&lt;br&gt;
{ "business_id": "B010", "name": "Quick Bite", "rating": 4.0, "review": "Good service and clean place.", "date": "2025-11-07" }&lt;br&gt;
])&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff2z9e15nin99u5arrowz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ff2z9e15nin99u5arrowz.png" alt=" " width="800" height="428"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;🔍 Step 4: Queries&lt;/strong&gt;&lt;br&gt;
🏆 4.1 Top 5 Businesses by Rating&lt;br&gt;
&lt;code&gt;javascript&lt;br&gt;
db.reviews.find().sort({ rating: -1 }).limit(5)&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;🔤 4.2 Count of Reviews Containing “good”&lt;br&gt;
&lt;code&gt;javascript&lt;br&gt;
db.reviews.countDocuments({ review: /good/i })&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;🏪 4.3 Get Reviews for a Specific Business ID&lt;br&gt;
&lt;code&gt;javascript&lt;br&gt;
db.reviews.find({ business_id: "B005" })&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fql4dp69c6of2z4zx9inp.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fql4dp69c6of2z4zx9inp.png" alt=" " width="800" height="416"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;✏️ Step 5: Update and Delete&lt;/strong&gt;&lt;br&gt;
✏️ Update a Review&lt;br&gt;
&lt;code&gt;javascript&lt;br&gt;
db.reviews.updateOne(&lt;br&gt;
{ business_id: "B005" },&lt;br&gt;
{ $set: { rating: 4.3, review: "Updated: Great taste and fresh ingredients!" } }&lt;br&gt;
)&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;🗑️ Delete a Record&lt;br&gt;
&lt;code&gt;javascript&lt;br&gt;
db.reviews.deleteOne({ business_id: "B010" })&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F51dioepei1qm2vsifz9y.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F51dioepei1qm2vsifz9y.png" alt=" " width="800" height="142"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📤 Step 6: Export Data to JSON/CSV&lt;/strong&gt;&lt;br&gt;
Exit Mongo shell:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;bash&lt;br&gt;
exit&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Then run the following from PowerShell (not inside mongosh) 👇&lt;/p&gt;

&lt;p&gt;📄 Export as CSV&lt;br&gt;
&lt;code&gt;bash&lt;br&gt;
mongoexport --uri="mongodb+srv://22cs098_db_user:NAVEEN@m0.wpjmxqh.mongodb.net/businessDB" --collection=reviews --type=csv --fields=business_id,name,rating,review,date --out=reviews.csv&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;📦 Export as JSON&lt;br&gt;
&lt;code&gt;bash&lt;br&gt;
mongoexport --uri="mongodb+srv://22cs098_db_user:NAVEEN@m0.wpjmxqh.mongodb.net/businessDB" --collection=reviews --out=reviews.json&lt;br&gt;
&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;📊 Step 7: View the Exported Files&lt;/strong&gt;&lt;br&gt;
Open reviews.csv in Excel or VS Code.&lt;br&gt;
Open reviews.json in any text editor.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fnon83zqja31wxbq8yey2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fnon83zqja31wxbq8yey2.png" alt=" " width="800" height="419"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;✅ Step 8: Summary&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Operation&lt;/th&gt;
&lt;th&gt;Command&lt;/th&gt;
&lt;th&gt;Example&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Insert&lt;/td&gt;
&lt;td&gt;insertMany()&lt;/td&gt;
&lt;td&gt;Add 10 review records&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Query&lt;/td&gt;
&lt;td&gt;find(), countDocuments()&lt;/td&gt;
&lt;td&gt;Retrieve data&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Update&lt;/td&gt;
&lt;td&gt;updateOne()&lt;/td&gt;
&lt;td&gt;Modify fields&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Delete&lt;/td&gt;
&lt;td&gt;deleteOne()&lt;/td&gt;
&lt;td&gt;Remove a record&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Export&lt;/td&gt;
&lt;td&gt;mongoexport&lt;/td&gt;
&lt;td&gt;Generate CSV/JSON files&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;🎯 Final Thoughts&lt;/strong&gt;&lt;br&gt;
MongoDB Atlas makes it easy to:&lt;/p&gt;

&lt;p&gt;Manage cloud-hosted databases&lt;br&gt;
Perform CRUD operations&lt;br&gt;
Export results in multiple formats&lt;br&gt;
This project demonstrates all essential MongoDB operations — perfect for Data Engineering and Database Management learning tasks.&lt;/p&gt;

</description>
      <category>datascience</category>
      <category>tutorial</category>
      <category>mongodb</category>
      <category>database</category>
    </item>
  </channel>
</rss>
