<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Audrine Marion</title>
    <description>The latest articles on Forem by Audrine Marion (@audrine_m).</description>
    <link>https://forem.com/audrine_m</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3818409%2F9fb58cc0-5333-4494-84ef-e77061565cba.jpg</url>
      <title>Forem: Audrine Marion</title>
      <link>https://forem.com/audrine_m</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/audrine_m"/>
    <language>en</language>
    <item>
      <title>Python for Data Analytics: A Beginner’s Complete Guide</title>
      <dc:creator>Audrine Marion</dc:creator>
      <pubDate>Sun, 10 May 2026 11:43:32 +0000</pubDate>
      <link>https://forem.com/audrine_m/python-for-data-analytics-a-beginners-complete-guide-4547</link>
      <guid>https://forem.com/audrine_m/python-for-data-analytics-a-beginners-complete-guide-4547</guid>
      <description>&lt;p&gt;In today’s digital world, data has become one of the most valuable resources. Every time we shop online, use social media, stream music, make a mobile payment, or search the internet, data is being created. Businesses and organizations use this information to understand customers, improve services, increase profits, and make smarter decisions.&lt;/p&gt;

&lt;p&gt;However, raw data alone is not useful unless it can be analyzed properly. This is where data analytics comes in. Data analytics helps transform raw information into meaningful insights that organizations can use to solve problems and make informed decisions.&lt;/p&gt;

&lt;p&gt;Among all the tools used in data analytics, Python has become one of the most popular and powerful programming languages in the world. From beginners learning their first programming language to professional data scientists working at major companies, Python is now considered an essential skill in the data industry.&lt;/p&gt;

&lt;p&gt;When I first started learning about data analytics, Python seemed intimidating. I assumed programming was only for software engineers and highly technical professionals. However, after exploring small projects and learning the basics, I realized why so many analysts rely on Python every day. It simplifies complex tasks, automates repetitive work, and makes analyzing data far more efficient.&lt;/p&gt;

&lt;p&gt;This article explains what Python is, why it is widely used in data analytics, the major Python libraries analysts use, how Python helps clean and visualize data, and why beginners should strongly consider learning it.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Is Python?
&lt;/h2&gt;

&lt;p&gt;Python is a high-level programming language created by Guido van Rossum and officially released in 1991. It was designed with simplicity and readability in mind, making it easier to understand compared to many older programming languages.&lt;/p&gt;

&lt;p&gt;Unlike some languages that use complicated syntax and lengthy code structures, Python focuses on writing clean and readable code. This makes it especially beginner-friendly.&lt;/p&gt;

&lt;p&gt;For example, a simple Python program can look like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Hello, World!&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That single line prints text to the screen.&lt;/p&gt;

&lt;p&gt;One reason Python became so popular is because it allows people to focus on solving problems instead of struggling with difficult syntax rules. Whether someone wants to build websites, automate tasks, develop artificial intelligence systems, or analyze data, Python provides a flexible and accessible starting point.&lt;/p&gt;

&lt;p&gt;Python is also open-source, meaning anyone can use it for free. It works across major operating systems including Windows, Linux, and macOS.&lt;/p&gt;

&lt;p&gt;Today, Python is used in many areas such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Web development&lt;/li&gt;
&lt;li&gt;Cybersecurity&lt;/li&gt;
&lt;li&gt;Automation&lt;/li&gt;
&lt;li&gt;Artificial intelligence&lt;/li&gt;
&lt;li&gt;Machine learning&lt;/li&gt;
&lt;li&gt;Data analytics&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Among these areas, data analytics has become one of Python’s strongest and fastest-growing fields.&lt;/p&gt;

&lt;h2&gt;
  
  
  Understanding Data Analytics
&lt;/h2&gt;

&lt;p&gt;Before exploring Python’s role in analytics, it is important to understand what data analytics actually means.&lt;/p&gt;

&lt;p&gt;Data analytics refers to the process of examining data to discover patterns, trends, relationships, and useful insights. Organizations use analytics to make better decisions and improve performance.&lt;/p&gt;

&lt;p&gt;For example, a company may analyze customer purchasing data to answer questions like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which products sell the most?&lt;/li&gt;
&lt;li&gt;Which customers are likely to stop buying?&lt;/li&gt;
&lt;li&gt;Which marketing campaigns perform best?&lt;/li&gt;
&lt;li&gt;What sales trends appear during different seasons?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Data analytics usually involves several steps:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Collecting data&lt;/li&gt;
&lt;li&gt;Cleaning and organizing data&lt;/li&gt;
&lt;li&gt;Exploring data&lt;/li&gt;
&lt;li&gt;Analyzing patterns&lt;/li&gt;
&lt;li&gt;Visualizing results&lt;/li&gt;
&lt;li&gt;Reporting insights&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Python can support all these stages efficiently, which is one of the reasons it has become so valuable in the analytics industry.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Python Is So Popular in Data Analytics
&lt;/h2&gt;

&lt;p&gt;Python’s popularity in data analytics continues to grow rapidly. Several factors contribute to this success.&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Python Is Beginner-Friendly
&lt;/h3&gt;

&lt;p&gt;One of Python’s greatest strengths is its simplicity.&lt;/p&gt;

&lt;p&gt;Compared to programming languages such as Java or C++, Python uses cleaner and more readable syntax. Beginners can often understand Python code even with limited programming experience.&lt;/p&gt;

&lt;p&gt;For example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;numbers&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;20&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;30&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;40&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;span class="n"&gt;average&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nf"&gt;sum&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;numbers&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="nf"&gt;len&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;numbers&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;average&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Even someone new to coding can understand that this program calculates an average.&lt;/p&gt;

&lt;p&gt;Because of its simplicity, many universities and online learning platforms now teach Python as a first programming language.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Python Has a Huge Community
&lt;/h3&gt;

&lt;p&gt;Python has millions of users worldwide. This large community makes learning easier because beginners can quickly find tutorials, videos, forums, and documentation online.&lt;/p&gt;

&lt;p&gt;If a learner encounters an error or problem, chances are high that someone else has already solved it and shared the solution online.&lt;/p&gt;

&lt;p&gt;Popular learning platforms include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.python.org?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Python Official Website&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.kaggle.com?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Kaggle&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://www.freecodecamp.org?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;freeCodeCamp&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://stackoverflow.com?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Stack Overflow&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This strong community support makes Python easier to learn compared to many technical tools.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Python Has Powerful Data Libraries
&lt;/h3&gt;

&lt;p&gt;One major reason analysts love Python is its collection of specialized libraries.&lt;/p&gt;

&lt;p&gt;Libraries are pre-written collections of code that help programmers complete tasks more efficiently. Instead of writing everything from scratch, analysts can use these libraries to process and analyze data quickly.&lt;/p&gt;

&lt;p&gt;Some important Python libraries include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Pandas&lt;/li&gt;
&lt;li&gt;NumPy&lt;/li&gt;
&lt;li&gt;Matplotlib&lt;/li&gt;
&lt;li&gt;Seaborn&lt;/li&gt;
&lt;li&gt;Scikit-learn&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These libraries have transformed Python into one of the most powerful analytics tools available today.&lt;/p&gt;

&lt;h3&gt;
  
  
  4. Python Saves Time Through Automation
&lt;/h3&gt;

&lt;p&gt;Many data tasks are repetitive and time-consuming.&lt;/p&gt;

&lt;p&gt;For example, analysts may need to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Clean datasets daily&lt;/li&gt;
&lt;li&gt;Generate weekly reports&lt;/li&gt;
&lt;li&gt;Update dashboards&lt;/li&gt;
&lt;li&gt;Organize spreadsheets&lt;/li&gt;
&lt;li&gt;Combine multiple files&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Doing these tasks manually can waste hours.&lt;/p&gt;

&lt;p&gt;Python helps automate these processes using scripts, reducing human error and improving efficiency.&lt;/p&gt;

&lt;p&gt;Automation is one reason companies increasingly prefer analysts with Python skills.&lt;/p&gt;

&lt;h3&gt;
  
  
  5. Python Is Used Across Industries
&lt;/h3&gt;

&lt;p&gt;Python is not limited to one field.&lt;/p&gt;

&lt;p&gt;Industries using Python for analytics include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Finance&lt;/li&gt;
&lt;li&gt;Healthcare&lt;/li&gt;
&lt;li&gt;E-commerce&lt;/li&gt;
&lt;li&gt;Marketing&lt;/li&gt;
&lt;li&gt;Transportation&lt;/li&gt;
&lt;li&gt;Telecommunications&lt;/li&gt;
&lt;li&gt;Education&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Because Python is widely applicable, learning it opens opportunities across many career paths.&lt;/p&gt;

&lt;h2&gt;
  
  
  Important Python Libraries for Data Analytics
&lt;/h2&gt;

&lt;p&gt;Python’s true strength in analytics comes from its libraries.&lt;/p&gt;

&lt;h3&gt;
  
  
  Pandas
&lt;/h3&gt;

&lt;p&gt;Pandas is one of the most important Python libraries for data analytics.&lt;/p&gt;

&lt;p&gt;It allows analysts to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Read CSV and Excel files&lt;/li&gt;
&lt;li&gt;Clean messy data&lt;/li&gt;
&lt;li&gt;Filter rows and columns&lt;/li&gt;
&lt;li&gt;Merge datasets&lt;/li&gt;
&lt;li&gt;Calculate statistics&lt;/li&gt;
&lt;li&gt;Organize information into tables&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Pandas uses structures called DataFrames, which resemble Excel spreadsheets.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;p7t9q1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;pandas&lt;/span&gt; &lt;span class="k"&gt;as&lt;/span&gt; &lt;span class="n"&gt;pd&lt;/span&gt;

&lt;span class="n"&gt;data&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Name&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Alice&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Brian&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Cynthia&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Age&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;24&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;29&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;31&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="n"&gt;df&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;pd&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nc"&gt;DataFrame&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;data&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This creates a simple table of names and ages.&lt;/p&gt;

&lt;p&gt;Pandas is essential for almost every data analytics project.&lt;/p&gt;

&lt;h3&gt;
  
  
  NumPy
&lt;/h3&gt;

&lt;p&gt;NumPy focuses on numerical operations and mathematical calculations.&lt;/p&gt;

&lt;p&gt;It is especially useful when handling large amounts of numerical data efficiently.&lt;/p&gt;

&lt;p&gt;NumPy helps analysts:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Perform calculations&lt;/li&gt;
&lt;li&gt;Work with arrays&lt;/li&gt;
&lt;li&gt;Calculate statistics&lt;/li&gt;
&lt;li&gt;Handle mathematical functions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;f4w2r8&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;numpy&lt;/span&gt; &lt;span class="k"&gt;as&lt;/span&gt; &lt;span class="n"&gt;np&lt;/span&gt;

&lt;span class="n"&gt;numbers&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;np&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;array&lt;/span&gt;&lt;span class="p"&gt;([&lt;/span&gt;&lt;span class="mi"&gt;5&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;15&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;20&lt;/span&gt;&lt;span class="p"&gt;])&lt;/span&gt;

&lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;numbers&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;mean&lt;/span&gt;&lt;span class="p"&gt;())&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This calculates the average of the numbers.&lt;/p&gt;

&lt;h3&gt;
  
  
  Matplotlib
&lt;/h3&gt;

&lt;p&gt;Matplotlib is one of the most widely used visualization libraries in Python.&lt;/p&gt;

&lt;p&gt;Data visualization helps convert raw data into understandable charts and graphs.&lt;/p&gt;

&lt;p&gt;Matplotlib can create:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Line charts&lt;/li&gt;
&lt;li&gt;Bar charts&lt;/li&gt;
&lt;li&gt;Pie charts&lt;/li&gt;
&lt;li&gt;Histograms&lt;/li&gt;
&lt;li&gt;Scatter plots&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;l1m9s7&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;matplotlib.pyplot&lt;/span&gt; &lt;span class="k"&gt;as&lt;/span&gt; &lt;span class="n"&gt;plt&lt;/span&gt;

&lt;span class="n"&gt;x&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;3&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;4&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
&lt;span class="n"&gt;y&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;10&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;15&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;25&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;40&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;

&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;plot&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;x&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;y&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;plt&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;show&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This generates a simple line graph.&lt;/p&gt;

&lt;p&gt;Visualizations make it easier for businesses and decision-makers to understand data trends.&lt;/p&gt;

&lt;h3&gt;
  
  
  Seaborn
&lt;/h3&gt;

&lt;p&gt;Seaborn is built on top of Matplotlib and provides more advanced and visually appealing charts.&lt;/p&gt;

&lt;p&gt;Analysts use Seaborn for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Correlation heatmaps&lt;/li&gt;
&lt;li&gt;Statistical charts&lt;/li&gt;
&lt;li&gt;Distribution plots&lt;/li&gt;
&lt;li&gt;Comparative visualizations&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Seaborn is popular because it creates professional-looking visuals with relatively little code.&lt;/p&gt;

&lt;h3&gt;
  
  
  Scikit-learn
&lt;/h3&gt;

&lt;p&gt;Scikit-learn is widely used for machine learning and predictive analytics.&lt;/p&gt;

&lt;p&gt;Although machine learning is more advanced than basic analytics, many analysts eventually use Scikit-learn for predictive tasks such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Customer churn prediction&lt;/li&gt;
&lt;li&gt;Fraud detection&lt;/li&gt;
&lt;li&gt;Sales forecasting&lt;/li&gt;
&lt;li&gt;Recommendation systems&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Scikit-learn simplifies complex machine learning processes for developers and analysts.&lt;/p&gt;

&lt;h2&gt;
  
  
  How Python Is Used in Data Analytics
&lt;/h2&gt;

&lt;p&gt;Python supports analysts throughout the entire analytics workflow.&lt;/p&gt;

&lt;h3&gt;
  
  
  Data Collection
&lt;/h3&gt;

&lt;p&gt;The first step in analytics is gathering data.&lt;/p&gt;

&lt;p&gt;Python can collect data from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;CSV files&lt;/li&gt;
&lt;li&gt;Excel spreadsheets&lt;/li&gt;
&lt;li&gt;Databases&lt;/li&gt;
&lt;li&gt;APIs&lt;/li&gt;
&lt;li&gt;Websites&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Analysts often use Python for web scraping, which involves extracting information from websites automatically.&lt;/p&gt;

&lt;p&gt;Libraries such as BeautifulSoup and Requests help collect online data efficiently.&lt;/p&gt;

&lt;p&gt;For example, companies may scrape product prices, customer reviews, or market information for analysis.&lt;/p&gt;

&lt;h3&gt;
  
  
  Data Cleaning
&lt;/h3&gt;

&lt;p&gt;Real-world data is rarely perfect.&lt;/p&gt;

&lt;p&gt;Datasets often contain:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Missing values&lt;/li&gt;
&lt;li&gt;Duplicate records&lt;/li&gt;
&lt;li&gt;Formatting problems&lt;/li&gt;
&lt;li&gt;Incorrect entries&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Poor-quality data can produce misleading insights.&lt;/p&gt;

&lt;p&gt;Python makes cleaning easier using Pandas.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;n6r3b5&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;dropna&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This removes rows with missing values.&lt;/p&gt;

&lt;p&gt;Another example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;u8w2y4&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;drop_duplicates&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This removes duplicate records.&lt;/p&gt;

&lt;p&gt;Python helps automate cleaning tasks that would take much longer in spreadsheet software.&lt;/p&gt;

&lt;h3&gt;
  
  
  Data Exploration
&lt;/h3&gt;

&lt;p&gt;Once data is cleaned, analysts explore it to understand patterns and trends.&lt;/p&gt;

&lt;p&gt;This stage includes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Finding averages&lt;/li&gt;
&lt;li&gt;Measuring relationships&lt;/li&gt;
&lt;li&gt;Identifying unusual values&lt;/li&gt;
&lt;li&gt;Understanding distributions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Python helps generate quick summaries.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt; &lt;span class="nb"&gt;id&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;h5t2e1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="n"&gt;df&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;describe&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This produces statistics such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Mean&lt;/li&gt;
&lt;li&gt;Minimum value&lt;/li&gt;
&lt;li&gt;Maximum value&lt;/li&gt;
&lt;li&gt;Standard deviation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Exploring data helps analysts understand what the dataset contains before performing deeper analysis.&lt;/p&gt;

&lt;h3&gt;
  
  
  Data Visualization
&lt;/h3&gt;

&lt;p&gt;One of the most important parts of analytics is visualization.&lt;/p&gt;

&lt;p&gt;Humans understand visuals faster than raw numbers. Charts and graphs simplify complex information.&lt;/p&gt;

&lt;p&gt;Python allows analysts to create dashboards and visuals showing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Sales performance&lt;/li&gt;
&lt;li&gt;Customer behavior&lt;/li&gt;
&lt;li&gt;Market trends&lt;/li&gt;
&lt;li&gt;Financial reports&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For example, a company may use graphs to identify which products perform best during certain months.&lt;/p&gt;

&lt;p&gt;Good visualizations improve communication and support better business decisions.&lt;/p&gt;

&lt;h3&gt;
  
  
  Predictive Analytics and Machine Learning
&lt;/h3&gt;

&lt;p&gt;Modern analytics increasingly focuses on predicting future outcomes instead of only analyzing past data.&lt;/p&gt;

&lt;p&gt;Python supports predictive analytics using machine learning.&lt;/p&gt;

&lt;p&gt;Organizations use Python to predict:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Customer behavior&lt;/li&gt;
&lt;li&gt;Product demand&lt;/li&gt;
&lt;li&gt;Stock trends&lt;/li&gt;
&lt;li&gt;Equipment failures&lt;/li&gt;
&lt;li&gt;Fraudulent transactions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This allows businesses to make proactive decisions instead of reacting after problems occur.&lt;/p&gt;

&lt;h2&gt;
  
  
  Real-World Examples of Python in Data Analytics
&lt;/h2&gt;

&lt;p&gt;Python is used globally across multiple industries.&lt;/p&gt;

&lt;h3&gt;
  
  
  Healthcare Analytics
&lt;/h3&gt;

&lt;p&gt;Hospitals and healthcare organizations analyze patient data using Python.&lt;/p&gt;

&lt;p&gt;Analytics helps healthcare professionals:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Predict disease risks&lt;/li&gt;
&lt;li&gt;Monitor patient trends&lt;/li&gt;
&lt;li&gt;Improve hospital efficiency&lt;/li&gt;
&lt;li&gt;Analyze treatment effectiveness&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Python played an important role in analyzing health data during global disease outbreaks.&lt;/p&gt;

&lt;h3&gt;
  
  
  Finance and Banking
&lt;/h3&gt;

&lt;p&gt;Banks generate massive amounts of financial data daily.&lt;/p&gt;

&lt;p&gt;Python is widely used for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Fraud detection&lt;/li&gt;
&lt;li&gt;Risk assessment&lt;/li&gt;
&lt;li&gt;Credit scoring&lt;/li&gt;
&lt;li&gt;Investment analysis&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Financial analysts use Python because it processes large datasets efficiently.&lt;/p&gt;

&lt;h3&gt;
  
  
  E-Commerce
&lt;/h3&gt;

&lt;p&gt;Online shopping platforms rely heavily on analytics.&lt;/p&gt;

&lt;p&gt;Python helps businesses analyze:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Customer purchasing behavior&lt;/li&gt;
&lt;li&gt;Product recommendations&lt;/li&gt;
&lt;li&gt;Website performance&lt;/li&gt;
&lt;li&gt;Sales trends&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Recommendation systems used by streaming and shopping platforms are often powered by Python.&lt;/p&gt;

&lt;h3&gt;
  
  
  Marketing Analytics
&lt;/h3&gt;

&lt;p&gt;Marketing teams use Python to evaluate campaign performance.&lt;/p&gt;

&lt;p&gt;Analytics helps companies understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Audience engagement&lt;/li&gt;
&lt;li&gt;Advertisement performance&lt;/li&gt;
&lt;li&gt;Customer interests&lt;/li&gt;
&lt;li&gt;Social media trends&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This helps businesses improve their marketing strategies.&lt;/p&gt;

&lt;h3&gt;
  
  
  Transportation and Logistics
&lt;/h3&gt;

&lt;p&gt;Transportation companies use Python for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Route optimization&lt;/li&gt;
&lt;li&gt;Demand forecasting&lt;/li&gt;
&lt;li&gt;Delivery tracking&lt;/li&gt;
&lt;li&gt;Traffic analysis&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Analytics improves efficiency and reduces operational costs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Common Mistakes Beginners Make When Learning Python
&lt;/h2&gt;

&lt;p&gt;Many beginners struggle because they approach Python the wrong way.&lt;/p&gt;

&lt;p&gt;Some common mistakes include:&lt;/p&gt;

&lt;h3&gt;
  
  
  Trying to Learn Everything at Once
&lt;/h3&gt;

&lt;p&gt;Python is a large language with many areas. Beginners should focus on fundamentals first instead of rushing into advanced topics.&lt;/p&gt;

&lt;h3&gt;
  
  
  Watching Tutorials Without Practicing
&lt;/h3&gt;

&lt;p&gt;Watching videos alone is not enough.&lt;/p&gt;

&lt;p&gt;The best way to learn Python is by writing code regularly and building small projects.&lt;/p&gt;

&lt;h3&gt;
  
  
  Ignoring Real Datasets
&lt;/h3&gt;

&lt;p&gt;Real learning happens when working with messy, real-world data.&lt;/p&gt;

&lt;p&gt;Beginners should practice using datasets from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://www.kaggle.com/datasets?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;Kaggle Datasets&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://archive.ics.uci.edu/ml/index.php?utm_source=chatgpt.com" rel="noopener noreferrer"&gt;UCI Machine Learning Repository&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Focusing Too Much on Syntax
&lt;/h3&gt;

&lt;p&gt;Many beginners try to memorize every command.&lt;/p&gt;

&lt;p&gt;Instead, they should focus on problem-solving and understanding logic.&lt;/p&gt;

&lt;p&gt;Even experienced developers frequently search online for syntax help.&lt;/p&gt;

&lt;h2&gt;
  
  
  Beginner Python Analytics Project Idea
&lt;/h2&gt;

&lt;p&gt;One of the best ways to improve is through projects.&lt;/p&gt;

&lt;p&gt;A simple beginner project could involve analyzing sales data.&lt;/p&gt;

&lt;p&gt;Steps might include:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Import a CSV file using Pandas&lt;/li&gt;
&lt;li&gt;Clean missing values&lt;/li&gt;
&lt;li&gt;Calculate total sales&lt;/li&gt;
&lt;li&gt;Identify top-selling products&lt;/li&gt;
&lt;li&gt;Create charts showing trends&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This type of project helps beginners practice real analytics workflows.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Beginners Should Learn Python
&lt;/h2&gt;

&lt;p&gt;Learning Python offers many long-term benefits.&lt;/p&gt;

&lt;h3&gt;
  
  
  Strong Career Opportunities
&lt;/h3&gt;

&lt;p&gt;Many employers actively search for candidates with Python skills.&lt;/p&gt;

&lt;p&gt;Python appears frequently in job descriptions for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data Analysts&lt;/li&gt;
&lt;li&gt;Data Scientists&lt;/li&gt;
&lt;li&gt;Business Analysts&lt;/li&gt;
&lt;li&gt;Machine Learning Engineers&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;As data continues growing globally, demand for Python skills is expected to remain strong.&lt;/p&gt;

&lt;h3&gt;
  
  
  Easy Entry Into Tech
&lt;/h3&gt;

&lt;p&gt;Python provides a relatively accessible entry point into the technology industry.&lt;/p&gt;

&lt;p&gt;Because it is beginner-friendly, learners can start building useful projects quickly.&lt;/p&gt;

&lt;h3&gt;
  
  
  Gateway to Advanced Technologies
&lt;/h3&gt;

&lt;p&gt;Python is heavily used in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Artificial intelligence&lt;/li&gt;
&lt;li&gt;Machine learning&lt;/li&gt;
&lt;li&gt;Automation&lt;/li&gt;
&lt;li&gt;Deep learning&lt;/li&gt;
&lt;li&gt;Data science&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Learning Python opens doors to multiple technical career paths.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Future of Python in Data Analytics
&lt;/h2&gt;

&lt;p&gt;Python continues to grow rapidly in popularity.&lt;/p&gt;

&lt;p&gt;As organizations generate more data, the demand for analytical tools increases. Python remains central to this growth because of its flexibility, large community, and extensive library ecosystem.&lt;/p&gt;

&lt;p&gt;Emerging fields such as artificial intelligence, automation, and big data analytics continue relying heavily on Python.&lt;/p&gt;

&lt;p&gt;It is likely that Python will remain one of the dominant programming languages in analytics for many years.&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Python has become one of the most important tools in modern data analytics. Its simplicity, flexibility, and powerful libraries make it suitable for both beginners and experienced professionals.&lt;/p&gt;

&lt;p&gt;Through libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-learn, Python enables analysts to collect, clean, analyze, and visualize data efficiently. Organizations across healthcare, finance, transportation, marketing, and e-commerce rely on Python to make data-driven decisions.&lt;/p&gt;

&lt;p&gt;For beginners interested in analytics, learning Python is one of the best investments they can make. It provides practical problem-solving skills, strong career opportunities, and access to advanced technologies such as machine learning and artificial intelligence.&lt;/p&gt;

&lt;p&gt;Although learning Python requires patience and practice, consistent effort leads to significant growth over time. Starting with small projects, experimenting with real datasets, and practicing regularly can help beginners build confidence and develop valuable analytical skills.&lt;/p&gt;

&lt;p&gt;In a world increasingly driven by data, Python is more than just a programming language. It is a powerful tool that helps transform raw information into meaningful insights that can shape businesses, industries, and society itself.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>python</category>
      <category>datascience</category>
      <category>dataanalytics</category>
    </item>
    <item>
      <title>SQL Subqueries vs CTEs: Types, Differences, Performance, and When to Use Each</title>
      <dc:creator>Audrine Marion</dc:creator>
      <pubDate>Tue, 21 Apr 2026 11:44:41 +0000</pubDate>
      <link>https://forem.com/audrine_m/sql-subqueries-vs-ctes-types-differences-performance-and-when-to-use-each-4if</link>
      <guid>https://forem.com/audrine_m/sql-subqueries-vs-ctes-types-differences-performance-and-when-to-use-each-4if</guid>
      <description>&lt;p&gt;When working with SQL in analytics, reporting, or data engineering workflows, two powerful tools help structure complex logic:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Subqueries&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Common Table Expressions (CTEs)&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Understanding when to use each improves &lt;strong&gt;query readability, performance, and maintainability&lt;/strong&gt;—especially when building dashboards or transforming datasets for analysis.&lt;/p&gt;

&lt;p&gt;This article explains:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;what subqueries are&lt;/li&gt;
&lt;li&gt;types of subqueries&lt;/li&gt;
&lt;li&gt;when to use subqueries&lt;/li&gt;
&lt;li&gt;what CTEs are&lt;/li&gt;
&lt;li&gt;types of CTEs&lt;/li&gt;
&lt;li&gt;performance comparison&lt;/li&gt;
&lt;li&gt;when to choose subqueries vs CTEs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I started paying closer attention to the difference between subqueries and CTEs while working on analytics queries that became harder to debug as they grew more complex.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Foksmw0i50mg4m12hht3e.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Foksmw0i50mg4m12hht3e.png" alt="Subqueries and CTEs"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What Is a Subquery?
&lt;/h2&gt;

&lt;p&gt;A &lt;strong&gt;subquery&lt;/strong&gt; is a query nested inside another SQL query.&lt;/p&gt;

&lt;p&gt;It executes first, and its output becomes input for the outer query.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;name&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;salary&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;salary&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="k"&gt;AVG&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Here:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the inner query calculates the average salary&lt;/li&gt;
&lt;li&gt;the outer query filters employees earning above average&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Subqueries are useful for embedding logic directly inside SQL statements.&lt;/p&gt;




&lt;h2&gt;
  
  
  Types of Subqueries
&lt;/h2&gt;

&lt;p&gt;Subqueries can return different result structures depending on how they are used.&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Scalar Subquery
&lt;/h3&gt;

&lt;p&gt;Returns &lt;strong&gt;one value&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;name&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
       &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="k"&gt;AVG&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="n"&gt;avg_salary&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Use case:&lt;/p&gt;

&lt;p&gt;Comparisons or calculated columns&lt;/p&gt;




&lt;h3&gt;
  
  
  2.  Single-Row Subquery
&lt;/h3&gt;

&lt;p&gt;Returns exactly &lt;strong&gt;one row&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;salary&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="k"&gt;MAX&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Common operators:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=  &amp;gt;  &amp;lt;  &amp;gt;=  &amp;lt;=
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h3&gt;
  
  
  3. Multi-Row Subquery
&lt;/h3&gt;

&lt;p&gt;Returns &lt;strong&gt;multiple rows&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;department_id&lt;/span&gt; &lt;span class="k"&gt;IN&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;department_id&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;departments&lt;/span&gt;
    &lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="k"&gt;location&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="s1"&gt;'Nairobi'&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Common operators:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;IN&lt;/span&gt;
&lt;span class="k"&gt;ANY&lt;/span&gt;
&lt;span class="k"&gt;ALL&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  4. Correlated Subquery
&lt;/h2&gt;

&lt;p&gt;Runs once &lt;strong&gt;for every row&lt;/strong&gt; in the outer query&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;name&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;
&lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;salary&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="k"&gt;AVG&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
    &lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;department_id&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;department_id&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Use carefully with large datasets because they may affect performance.&lt;/p&gt;




&lt;h2&gt;
  
  
  When Should You Use Subqueries?
&lt;/h2&gt;

&lt;p&gt;Subqueries are best when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;filtering using aggregated values&lt;/li&gt;
&lt;li&gt;comparing values dynamically&lt;/li&gt;
&lt;li&gt;embedding quick logic inside WHERE clauses&lt;/li&gt;
&lt;li&gt;simplifying small calculations&lt;/li&gt;
&lt;li&gt;avoiding temporary tables&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;They work especially well when logic is &lt;strong&gt;used once only&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Is a CTE (Common Table Expression)?
&lt;/h2&gt;

&lt;p&gt;A &lt;strong&gt;Common Table Expression (CTE)&lt;/strong&gt; is a temporary named result set created using the &lt;code&gt;WITH&lt;/code&gt; clause.&lt;/p&gt;

&lt;p&gt;It improves readability and breaks complex queries into logical steps.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;WITH&lt;/span&gt; &lt;span class="n"&gt;avg_salary&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="k"&gt;AVG&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="n"&gt;avg_sal&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
&lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;salary&lt;/span&gt; &lt;span class="o"&gt;&amp;gt;&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;avg_sal&lt;/span&gt; &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;avg_salary&lt;/span&gt;&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Think of a CTE as a temporary table that exists during query execution.&lt;/p&gt;




&lt;h2&gt;
  
  
  Types of CTEs
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Non-Recursive CTE
&lt;/h3&gt;

&lt;p&gt;Used for simplifying complex logic&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;WITH&lt;/span&gt; &lt;span class="n"&gt;department_totals&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;department_id&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
           &lt;span class="k"&gt;SUM&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;salary&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="n"&gt;total_salary&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
    &lt;span class="k"&gt;GROUP&lt;/span&gt; &lt;span class="k"&gt;BY&lt;/span&gt; &lt;span class="n"&gt;department_id&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;department_totals&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Best for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;transformations&lt;/li&gt;
&lt;li&gt;aggregations&lt;/li&gt;
&lt;li&gt;reusable logic&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  2. Recursive CTE
&lt;/h3&gt;

&lt;p&gt;Used for hierarchical datasets&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;WITH&lt;/span&gt; &lt;span class="k"&gt;RECURSIVE&lt;/span&gt; &lt;span class="n"&gt;employee_hierarchy&lt;/span&gt; &lt;span class="k"&gt;AS&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;

    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;id&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;manager_id&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;name&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt;
    &lt;span class="k"&gt;WHERE&lt;/span&gt; &lt;span class="n"&gt;manager_id&lt;/span&gt; &lt;span class="k"&gt;IS&lt;/span&gt; &lt;span class="k"&gt;NULL&lt;/span&gt;

    &lt;span class="k"&gt;UNION&lt;/span&gt; &lt;span class="k"&gt;ALL&lt;/span&gt;

    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;id&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;manager_id&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;name&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employees&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;
    &lt;span class="k"&gt;JOIN&lt;/span&gt; &lt;span class="n"&gt;employee_hierarchy&lt;/span&gt; &lt;span class="n"&gt;eh&lt;/span&gt;
    &lt;span class="k"&gt;ON&lt;/span&gt; &lt;span class="n"&gt;e&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;manager_id&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;eh&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;id&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
&lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;employee_hierarchy&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Common use cases:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;organization structures&lt;/li&gt;
&lt;li&gt;category trees&lt;/li&gt;
&lt;li&gt;folder hierarchies&lt;/li&gt;
&lt;li&gt;graph traversal&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Subqueries vs CTEs: Key Differences
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;Subqueries&lt;/th&gt;
&lt;th&gt;CTEs&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Readability&lt;/td&gt;
&lt;td&gt;Moderate&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Reusability&lt;/td&gt;
&lt;td&gt;Low&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Debugging&lt;/td&gt;
&lt;td&gt;Harder&lt;/td&gt;
&lt;td&gt;Easier&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Recursion Support&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Best Use Case&lt;/td&gt;
&lt;td&gt;Simple filtering&lt;/td&gt;
&lt;td&gt;Multi-step logic&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Performance Comparison
&lt;/h2&gt;

&lt;p&gt;Performance depends on the database system.&lt;/p&gt;

&lt;p&gt;General guidance:&lt;/p&gt;

&lt;p&gt;Use &lt;strong&gt;subqueries&lt;/strong&gt; when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;logic is simple&lt;/li&gt;
&lt;li&gt;result used once&lt;/li&gt;
&lt;li&gt;query is short&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Use &lt;strong&gt;CTEs&lt;/strong&gt; when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;logic reused multiple times&lt;/li&gt;
&lt;li&gt;queries become complex&lt;/li&gt;
&lt;li&gt;building layered transformations&lt;/li&gt;
&lt;li&gt;working with hierarchies&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Test performance using:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;EXPLAIN&lt;/span&gt; &lt;span class="k"&gt;ANALYZE&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;This helps evaluate execution plans.&lt;/p&gt;




&lt;h2&gt;
  
  
  When Should You Choose Each?
&lt;/h2&gt;

&lt;p&gt;Choose a &lt;strong&gt;subquery&lt;/strong&gt; if:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;quick filtering needed&lt;/li&gt;
&lt;li&gt;used inside WHERE clause&lt;/li&gt;
&lt;li&gt;logic simple&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Choose a &lt;strong&gt;CTE&lt;/strong&gt; if:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;readability matters&lt;/li&gt;
&lt;li&gt;transformation steps needed&lt;/li&gt;
&lt;li&gt;recursion required&lt;/li&gt;
&lt;li&gt;logic reused multiple times&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;Both subqueries and CTEs are essential SQL tools for analysts and BI professionals.&lt;/p&gt;

&lt;p&gt;Subqueries help embed quick logic inside statements.&lt;/p&gt;

&lt;p&gt;CTEs improve structure, readability, and scalability—especially in analytics pipelines and dashboards.&lt;/p&gt;

&lt;p&gt;Knowing when to use each makes your SQL cleaner, faster to debug, and easier to maintain.&lt;/p&gt;

</description>
      <category>sql</category>
      <category>datascience</category>
      <category>beginners</category>
      <category>database</category>
    </item>
    <item>
      <title>How to publish a Power BI report and embed it into a website.</title>
      <dc:creator>Audrine Marion</dc:creator>
      <pubDate>Sun, 05 Apr 2026 15:58:02 +0000</pubDate>
      <link>https://forem.com/audrine_m/how-to-publish-a-power-bi-report-and-embed-it-into-a-website-4lb9</link>
      <guid>https://forem.com/audrine_m/how-to-publish-a-power-bi-report-and-embed-it-into-a-website-4lb9</guid>
      <description>&lt;p&gt;Microsoft Power BI is a powerful business intelligence tool that enables users to transform raw data into interactive dashboards and reports. One of its most useful features is the ability to publish reports to the Power BI Service and embed them into websites for sharing insights with a wider audience.&lt;/p&gt;

&lt;p&gt;This guide walks through the full process step by step: creating a workspace, publishing a report, generating embed code, and embedding the report into a website.&lt;/p&gt;




&lt;h2&gt;
  
  
  Step 1: Create a Workspace in Power BI Service
&lt;/h2&gt;

&lt;p&gt;A workspace is a collaborative environment where reports, dashboards, and datasets are stored before publishing or sharing.&lt;/p&gt;

&lt;h3&gt;
  
  
  Steps:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;Sign in to &lt;strong&gt;Power BI Service&lt;/strong&gt; (&lt;a href="https://app.powerbi.com" rel="noopener noreferrer"&gt;https://app.powerbi.com&lt;/a&gt;).&lt;/li&gt;
&lt;li&gt;From the left navigation panel, select &lt;strong&gt;Workspaces&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Click &lt;strong&gt;New workspace&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Enter a workspace name and description.&lt;/li&gt;
&lt;li&gt;Click &lt;strong&gt;Save&lt;/strong&gt;.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq89u4512hp3umcrchc7o.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fq89u4512hp3umcrchc7o.png" alt="My Workspaces" width="800" height="364"&gt;&lt;/a&gt; &lt;br&gt;
Then proceed to add New Workspace&lt;/p&gt;

&lt;p&gt;Why this matters:&lt;br&gt;
Workspaces help organize reports and control access permissions for collaboration and publishing.&lt;/p&gt;




&lt;h2&gt;
  
  
  Step 2: Upload and Publish Your Report
&lt;/h2&gt;

&lt;p&gt;Once your workspace is ready, the next step is publishing your Power BI Desktop report.&lt;/p&gt;

&lt;h3&gt;
  
  
  Steps:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;Open your report in &lt;strong&gt;Power BI Desktop&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Click &lt;strong&gt;File → Publish&lt;/strong&gt; or select &lt;strong&gt;Publish&lt;/strong&gt; from the Home ribbon.&lt;/li&gt;
&lt;li&gt;Choose your created workspace.&lt;/li&gt;
&lt;li&gt;Click &lt;strong&gt;Select&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Wait for the confirmation message indicating successful publishing.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv4tnjytk6h3c6l2d3van.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv4tnjytk6h3c6l2d3van.png" alt="Publish Button" width="800" height="115"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Why this matters:&lt;br&gt;
Publishing moves your report from local development to the cloud where it can be shared and embedded.&lt;/p&gt;




&lt;h2&gt;
  
  
  Step 3: Generate the Embed Code
&lt;/h2&gt;

&lt;p&gt;After publishing the report, you can generate embed code from Power BI Service.&lt;/p&gt;

&lt;h3&gt;
  
  
  Steps:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;Open &lt;strong&gt;Power BI Service&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Navigate to your workspace.&lt;/li&gt;
&lt;li&gt;Select your report.&lt;/li&gt;
&lt;li&gt;Click &lt;strong&gt;File → Embed report → Publish to web (public)&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;Confirm the security warning.&lt;/li&gt;
&lt;li&gt;Copy the generated HTML iframe embed code.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9exb2ek1n41oydlnfcr9.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F9exb2ek1n41oydlnfcr9.png" alt="Embed Report" width="800" height="498"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6rs86zn917fmgy3jq7v3.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6rs86zn917fmgy3jq7v3.png" alt="iFrame" width="778" height="121"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Why this matters:&lt;br&gt;
The iframe embed code allows your report to be displayed inside a webpage.&lt;/p&gt;




&lt;h2&gt;
  
  
  Step 4: Embed the Report on a Website
&lt;/h2&gt;

&lt;p&gt;Once the embed code is generated, you can place it inside your website's HTML.&lt;/p&gt;

&lt;h3&gt;
  
  
  Example HTML Code:
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight html"&gt;&lt;code&gt;&lt;span class="nt"&gt;&amp;lt;iframe&lt;/span&gt;
    &lt;span class="na"&gt;width=&lt;/span&gt;&lt;span class="s"&gt;"800"&lt;/span&gt;
    &lt;span class="na"&gt;height=&lt;/span&gt;&lt;span class="s"&gt;"600"&lt;/span&gt;
    &lt;span class="na"&gt;src=&lt;/span&gt;&lt;span class="s"&gt;"https://app.powerbi.com/view?r=YOUR_EMBED_LINK"&lt;/span&gt;
    &lt;span class="na"&gt;frameborder=&lt;/span&gt;&lt;span class="s"&gt;"0"&lt;/span&gt;
    &lt;span class="na"&gt;allowFullScreen=&lt;/span&gt;&lt;span class="s"&gt;"true"&lt;/span&gt;&lt;span class="nt"&gt;&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;/iframe&amp;gt;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Steps:
&lt;/h3&gt;

&lt;ol&gt;
&lt;li&gt;Open your website HTML file.&lt;/li&gt;
&lt;li&gt;Paste the iframe code where the report should appear.&lt;/li&gt;
&lt;li&gt;Save changes.&lt;/li&gt;
&lt;li&gt;Refresh your webpage.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fd6j983dkrzgvqljmu7ol.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fd6j983dkrzgvqljmu7ol.png" alt="Web Example" width="800" height="245"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Why this matters:&lt;br&gt;
Embedding enables stakeholders and users to interact with dashboards without logging into Power BI.&lt;/p&gt;




&lt;h2&gt;
  
  
  Key Insights and Best Practices
&lt;/h2&gt;

&lt;p&gt;Publishing and embedding Power BI reports makes insights accessible to a broader audience. However, it is important to understand visibility settings before sharing.&lt;/p&gt;

&lt;p&gt;Key takeaways:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Workspaces help manage report access and organization.&lt;/li&gt;
&lt;li&gt;Publishing moves reports from Desktop to Power BI Service.&lt;/li&gt;
&lt;li&gt;Embed codes allow reports to be integrated into websites.&lt;/li&gt;
&lt;li&gt;"Publish to web" makes reports publicly accessible, so sensitive data should not be included.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Publishing and embedding Power BI reports is a straightforward process that significantly enhances data accessibility and collaboration. By following these steps, you can successfully move your report from Power BI Desktop to a live website and share insights interactively with your audience.&lt;/p&gt;

&lt;p&gt;Adding screenshots at each step improves clarity and helps users follow the workflow more easily.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>analytics</category>
    </item>
    <item>
      <title>Understanding Data Modeling in Power BI: Joins, Relationships, and Schemas Explained</title>
      <dc:creator>Audrine Marion</dc:creator>
      <pubDate>Sun, 29 Mar 2026 20:30:09 +0000</pubDate>
      <link>https://forem.com/audrine_m/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-5h02</link>
      <guid>https://forem.com/audrine_m/understanding-data-modeling-in-power-bi-joins-relationships-and-schemas-explained-5h02</guid>
      <description>&lt;p&gt;Data modeling is the backbone of every effective Power BI report. If dashboards feel slow, filters behave incorrectly, or numbers don’t match expectations, the issue is often the data model not the visuals.&lt;/p&gt;

&lt;p&gt;This guide explains how data modeling works in Power BI step‑by‑step. You’ll learn SQL joins, relationships, schemas, fact vs dimension tables, role‑playing dimensions, and how everything is created inside Power BI itself.&lt;/p&gt;

&lt;p&gt;This article is beginner‑friendly but structured like a professional BI reference.&lt;/p&gt;




&lt;h2&gt;
  
  
  What is Data Modeling?
&lt;/h2&gt;

&lt;p&gt;Data modeling is the process of organizing tables and defining how they relate so reports are accurate, scalable, and fast.&lt;/p&gt;

&lt;p&gt;In Power BI, good data modeling helps you:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;connect multiple datasets&lt;/li&gt;
&lt;li&gt;control filter behavior&lt;/li&gt;
&lt;li&gt;improve performance&lt;/li&gt;
&lt;li&gt;enable time intelligence&lt;/li&gt;
&lt;li&gt;prevent duplicate counting&lt;/li&gt;
&lt;li&gt;support executive‑level reporting&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Power BI primarily uses &lt;strong&gt;relationships instead of joins&lt;/strong&gt; during analysis.&lt;/p&gt;




&lt;h2&gt;
  
  
  SQL Joins Explained (With Real Examples)
&lt;/h2&gt;

&lt;p&gt;SQL joins combine tables physically in &lt;strong&gt;Power Query&lt;/strong&gt; before loading data into the model.&lt;/p&gt;

&lt;p&gt;Location in Power BI:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;Transform Data → Merge Queries&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Furf8lswibpdw45gkyf5b.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Furf8lswibpdw45gkyf5b.png" alt="Power BI Merge Queries window showing how SQL joins are created in Power Query" width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  INNER JOIN
&lt;/h3&gt;

&lt;p&gt;Returns only matching records in both tables.&lt;/p&gt;

&lt;p&gt;Example:&lt;/p&gt;

&lt;p&gt;Customers Table&lt;br&gt;&lt;br&gt;
Orders Table  &lt;/p&gt;

&lt;p&gt;Result: Only customers who placed orders appear.&lt;/p&gt;

&lt;p&gt;Real‑life analytics use case:&lt;br&gt;&lt;br&gt;
Analyzing purchasing customers only.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmefu463mf5k8o3vt9i66.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmefu463mf5k8o3vt9i66.png" alt="inner join" width="783" height="409"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  LEFT JOIN (LEFT OUTER JOIN)
&lt;/h3&gt;

&lt;p&gt;Returns all rows from the left table and matching rows from the right.&lt;/p&gt;

&lt;p&gt;Use case:&lt;br&gt;&lt;br&gt;
Customer engagement analysis including inactive customers&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxliae9q3pcvk8l2kquno.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxliae9q3pcvk8l2kquno.png" alt="left join" width="800" height="234"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  RIGHT JOIN (RIGHT OUTER JOIN)
&lt;/h3&gt;

&lt;p&gt;Returns all rows from the right table and matching rows from the left.&lt;/p&gt;

&lt;p&gt;Use case:&lt;br&gt;&lt;br&gt;
Transaction completeness audits&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fafedizmfk2zlscijtzng.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fafedizmfk2zlscijtzng.png" alt="Right Join" width="340" height="309"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  FULL OUTER JOIN
&lt;/h3&gt;

&lt;p&gt;Returns all rows from both tables.&lt;/p&gt;

&lt;p&gt;Use case:&lt;br&gt;&lt;br&gt;
Data reconciliation between systems&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmqinwdn10lfrvxx0wcqp.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fmqinwdn10lfrvxx0wcqp.png" alt="Full Outer Join" width="246" height="163"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F961xqltvxbnel78t7hux.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F961xqltvxbnel78t7hux.png" alt="Full Outer Join" width="148" height="106"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  LEFT ANTI JOIN
&lt;/h3&gt;

&lt;p&gt;Returns rows from the left table with &lt;strong&gt;no matches&lt;/strong&gt; in the right table.&lt;/p&gt;

&lt;p&gt;Use case:&lt;br&gt;&lt;br&gt;
Customer churn targeting&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwz52zuxcvx2uc25lq0pj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwz52zuxcvx2uc25lq0pj.png" alt="Left Anti Join" width="366" height="423"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  RIGHT ANTI JOIN
&lt;/h3&gt;

&lt;p&gt;Returns rows from the right table with &lt;strong&gt;no matches&lt;/strong&gt; in the left table.&lt;/p&gt;

&lt;p&gt;Use case:&lt;br&gt;&lt;br&gt;
Data quality audits&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3n0s21fj52wg5j8pt77y.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3n0s21fj52wg5j8pt77y.png" alt="Right Anti Join" width="373" height="415"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Relationships in Power BI
&lt;/h2&gt;

&lt;p&gt;Relationships connect tables &lt;strong&gt;logically instead of physically merging them&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Location:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;Model View → Drag column between tables&lt;/code&gt;&lt;br&gt;&lt;br&gt;
OR&lt;br&gt;&lt;br&gt;
&lt;code&gt;Home → Manage Relationships → New&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv2gjqf6x79f96qagzvgn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv2gjqf6x79f96qagzvgn.png" alt="Power BI Model View showing relationships between fact and dimension tables" width="748" height="337"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Types of Relationships
&lt;/h2&gt;

&lt;h3&gt;
  
  
  One‑to‑Many (1:M)
&lt;/h3&gt;

&lt;p&gt;Most common relationship type.&lt;br&gt;&lt;br&gt;
Example: DimCustomer → FactSales&lt;/p&gt;

&lt;p&gt;DimCustomer (1)&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Customer ID  │&lt;br&gt;
│ Customer Name│&lt;br&gt;
│ Region       │&lt;br&gt;
└──────┬───────┘&lt;br&gt;
       │&lt;br&gt;
       │&lt;br&gt;
       │&lt;br&gt;
       ▼&lt;br&gt;
FactSales (Many)&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Order ID     │&lt;br&gt;
│ Customer ID  │&lt;br&gt;
│ Sales Amount │&lt;br&gt;
│ Quantity     │&lt;br&gt;
└──────────────┘&lt;br&gt;
One-to-Many relationship: One customer can appear multiple times in the sales table.&lt;/p&gt;




&lt;h3&gt;
  
  
  Many‑to‑Many (M:M)
&lt;/h3&gt;

&lt;p&gt;Occurs when both tables contain duplicate keys.&lt;br&gt;&lt;br&gt;
Example: Students ↔ Courses&lt;/p&gt;

&lt;p&gt;Students&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Student ID   │&lt;br&gt;
│ Student Name │&lt;br&gt;
└──────┬───────┘&lt;br&gt;
       │&lt;br&gt;
       │&lt;br&gt;
       ▼&lt;br&gt;
Enrollment (Bridge Table)&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Student ID   │&lt;br&gt;
│ Course ID    │&lt;br&gt;
└──────┬───────┘&lt;br&gt;
       │&lt;br&gt;
       │&lt;br&gt;
       ▼&lt;br&gt;
Courses&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Course ID    │&lt;br&gt;
│ Course Name  │&lt;br&gt;
└──────────────┘&lt;br&gt;
Many-to-Many relationship resolved using a bridge table between Students and Courses.&lt;/p&gt;




&lt;h3&gt;
  
  
  One‑to‑One (1:1)
&lt;/h3&gt;

&lt;p&gt;Rare but useful.&lt;br&gt;&lt;br&gt;
Example: Employee table ↔ Employee security table&lt;br&gt;
Employees&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Employee ID  │&lt;br&gt;
│ Name         │&lt;br&gt;
│ Department   │&lt;br&gt;
└──────┬───────┘&lt;br&gt;
       │&lt;br&gt;
       │&lt;br&gt;
       ▼&lt;br&gt;
EmployeeSecurity&lt;br&gt;
┌──────────────┐&lt;br&gt;
│ Employee ID  │&lt;br&gt;
│ Access Level │&lt;br&gt;
│ Login Role   │&lt;br&gt;
└──────────────┘&lt;br&gt;
One-to-One relationship: Each employee record matches exactly one security profile.&lt;/p&gt;




&lt;h2&gt;
  
  
  Active vs Inactive Relationships
&lt;/h2&gt;

&lt;p&gt;Power BI allows multiple relationships between tables but only one active at a time.&lt;br&gt;
                DimDate&lt;br&gt;
            ┌──────────────┐&lt;br&gt;
            │ Date         │&lt;br&gt;
            └──────┬───────┘&lt;br&gt;
                   │&lt;br&gt;
        (Active) ──┼──────── OrderDate&lt;br&gt;
                   │&lt;br&gt;
     (Inactive) ─ ─┼──────── ShipDate&lt;br&gt;
                   │&lt;br&gt;
     (Inactive) ─ ─┼──────── DeliveryDate&lt;br&gt;
                   │&lt;br&gt;
               FactSales&lt;/p&gt;

&lt;p&gt;Active relationship (solid line) filters visuals automatically, while inactive relationships (dashed lines) require USERELATIONSHIP() in DAX.&lt;/p&gt;




&lt;h2&gt;
  
  
  Cardinality Explained
&lt;/h2&gt;

&lt;p&gt;Cardinality describes table relationship structure:&lt;br&gt;&lt;br&gt;
One‑to‑Many, Many‑to‑One, Many‑to‑Many, One‑to‑One&lt;/p&gt;

&lt;p&gt;Location:&lt;br&gt;&lt;br&gt;
&lt;code&gt;Manage Relationships → Cardinality dropdown&lt;/code&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Cross Filter Direction
&lt;/h2&gt;

&lt;p&gt;Controls how filters move between tables.  &lt;/p&gt;

&lt;h2&gt;
  
  
  Single Direction (recommended) vs Bi‑Directional
&lt;/h2&gt;

&lt;h2&gt;
  
  
  Difference Between Joins and Relationships
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Power Query Joins:&lt;/strong&gt; Combine tables physically, run before loading, increase table size
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Relationships:&lt;/strong&gt; Connect tables logically, run after loading, improve performance
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Fact vs Dimension Tables
&lt;/h2&gt;

&lt;p&gt;Fact Tables contain numeric values (Sales Amount, Revenue, Quantity)&lt;br&gt;&lt;br&gt;
Dimension Tables contain descriptive attributes (Customer Name, Product Category, Region)&lt;/p&gt;




&lt;h2&gt;
  
  
  Star Schema (Recommended Model)
&lt;/h2&gt;

&lt;p&gt;Fact table at center, dimension tables surrounding.&lt;br&gt;&lt;br&gt;
Fast performance, simple relationships, scalable dashboards.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2eonqumsxstkxm8r80ub.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2eonqumsxstkxm8r80ub.png" alt="Star schema example with FactSales connected to dimension tables Customer, Product, and Date" width="544" height="711"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Snowflake Schema
&lt;/h2&gt;

&lt;p&gt;Dimensions split into multiple related tables.&lt;br&gt;&lt;br&gt;
Reduced redundancy but more complex filtering.&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;                DimDepartment
              ┌──────────────┐
              │ Dept ID      │
              │ Dept Name    │
              └──────┬───────┘
                     │
                     │
                DimCategory
              ┌──────────────┐
              │ Category ID  │
              │ Category Name│
              └──────┬───────┘
                     │
                     │
                DimProduct
              ┌──────────────┐
              │ Product ID   │
              │ Product Name │
              └──────┬───────┘
                     │
                     │
             ┌───────▼────────┐
             │    FactSales   │
             │ Sales Amount   │
             │ Quantity       │
             │ Revenue        │
             │ Order ID       │
             └───────┬────────┘
                     │
                     │
                DimCustomer
              ┌──────────────┐
              │ Customer ID  │
              │ Customer Name│
              │ Region       │
              └──────────────┘
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;




&lt;h2&gt;
  
  
  Flat Table (Denormalized / DLAT Model)
&lt;/h2&gt;

&lt;p&gt;All fields stored inside one table.&lt;br&gt;&lt;br&gt;
Simple setup but poor scalability.&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;             Flat Table (Sales Dataset)
        ┌───────────────────────────────┐
        │ Order ID                      │
        │ Order Date                    │
        │ Customer ID                   │
        │ Customer Name                 │
        │ Region                        │
        │ Product ID                    │
        │ Product Name                  │
        │ Category                      │
        │ Department                    │
        │ Sales Amount                  │
        │ Quantity                      │
        │ Revenue                       │
        └───────────────────────────────┘
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;Example Flat Table (DLAT Model): All descriptive and transactional fields stored inside a single table without relationships between dimensions.&lt;/p&gt;




&lt;h2&gt;
  
  
  Role‑Playing Dimensions
&lt;/h2&gt;

&lt;p&gt;One dimension reused multiple times (e.g., Date table as Order Date, Ship Date, Delivery Date).&lt;/p&gt;




&lt;h2&gt;
  
  
  Common Data Modeling Mistakes in Power BI
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Circular relationships
&lt;/li&gt;
&lt;li&gt;Duplicate keys
&lt;/li&gt;
&lt;li&gt;Many‑to‑many misuse
&lt;/li&gt;
&lt;li&gt;Too many bi‑directional filters
&lt;/li&gt;
&lt;li&gt;Over‑joining tables in Power Query
&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Step‑by‑Step Modeling Workflow in Power BI
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;Home → Get Data
&lt;/li&gt;
&lt;li&gt;Transform Data → Power Query
&lt;/li&gt;
&lt;li&gt;Merge tables if required
&lt;/li&gt;
&lt;li&gt;Create relationships
&lt;/li&gt;
&lt;li&gt;Validate cardinality
&lt;/li&gt;
&lt;li&gt;Test filter flow
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo3n2djhg8uksiq9jkihh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo3n2djhg8uksiq9jkihh.png" alt="Step‑by‑Step Modeling Workflow in Power BI" width="800" height="529"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Generally
&lt;/h2&gt;

&lt;p&gt;Strong data modeling transforms dashboards into decision systems.&lt;br&gt;&lt;br&gt;
Master joins, schemas, and relationships to move from report builder to analytics engineer.&lt;/p&gt;

</description>
      <category>powerbi</category>
      <category>dataanalytics</category>
      <category>businessintelligence</category>
      <category>beginners</category>
    </item>
    <item>
      <title>How Excel is Used in Real-World Data Analysis</title>
      <dc:creator>Audrine Marion</dc:creator>
      <pubDate>Mon, 23 Mar 2026 16:17:18 +0000</pubDate>
      <link>https://forem.com/audrine_m/how-excel-is-used-in-real-world-data-analysis-4m66</link>
      <guid>https://forem.com/audrine_m/how-excel-is-used-in-real-world-data-analysis-4m66</guid>
      <description>&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;Excel is one of the most widely used tools for data analysis across industries. Although newer technologies like Python, SQL, and Power BI are becoming more common, Excel remains a foundational skill for anyone working with data. It allows analysts to organize information, clean datasets, perform calculations, explore patterns, and present insights in a structured and accessible way.&lt;/p&gt;

&lt;p&gt;Because of its flexibility and ease of use, Excel is often the first tool professionals rely on when working with raw data before moving into more advanced analytics environments.&lt;/p&gt;

&lt;h2&gt;
  
  
  What is Excel?
&lt;/h2&gt;

&lt;p&gt;Excel is a spreadsheet application developed by Microsoft that allows users to store, organize, calculate, and analyze data using rows and columns. It provides built-in formulas, visualization tools, Pivot Tables, and automation features that help transform raw numbers into meaningful insights.&lt;/p&gt;

&lt;p&gt;In real-world data analysis, Excel is commonly used for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Cleaning messy datasets&lt;/li&gt;
&lt;li&gt;Performing calculations&lt;/li&gt;
&lt;li&gt;Summarizing large datasets&lt;/li&gt;
&lt;li&gt;Identifying trends and patterns&lt;/li&gt;
&lt;li&gt;Creating dashboards and reports&lt;/li&gt;
&lt;li&gt;Supporting business decision-making&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Real-World Applications of Excel in Data Analysis
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Data Cleaning
&lt;/h3&gt;

&lt;p&gt;Before analysis begins, datasets often contain missing values, duplicates, or formatting issues. Excel helps analysts prepare datasets using tools such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Remove Duplicates&lt;/li&gt;
&lt;li&gt;Find and Replace&lt;/li&gt;
&lt;li&gt;Text-to-Columns&lt;/li&gt;
&lt;li&gt;Sorting and Filtering&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These features ensure that data is accurate and ready for analysis.&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Data Summarization Using Pivot Tables
&lt;/h2&gt;

&lt;p&gt;Pivot Tables are one of Excel’s most powerful tools. They allow analysts to quickly summarize large datasets and answer important questions such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which product generated the highest revenue?&lt;/li&gt;
&lt;li&gt;Which region had the most customers?&lt;/li&gt;
&lt;li&gt;Which month recorded the highest sales?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Instead of manually calculating totals, Pivot Tables automatically group and summarize information.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fm50re12oj67hnittjt41.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fm50re12oj67hnittjt41.png" alt="Excel Pivot Table summarizing sales data by region and product category"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;example of pivot table showing the sum of bonus by location&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Pivot Tables help analysts transform raw datasets into structured summaries within seconds.&lt;/p&gt;




&lt;h2&gt;
  
  
  3. Using Formulas for Analysis
&lt;/h2&gt;

&lt;p&gt;Excel formulas help analysts perform calculations efficiently and accurately.&lt;/p&gt;

&lt;h3&gt;
  
  
  SUM Function
&lt;/h3&gt;

&lt;p&gt;Used to calculate totals such as total sales or expenses.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=SUM(B2:B100)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Used in scenarios like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;calculating monthly expenses&lt;/li&gt;
&lt;li&gt;total company revenue&lt;/li&gt;
&lt;li&gt;inventory totals&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  IF Function
&lt;/h3&gt;

&lt;p&gt;Used to apply logical conditions to categorize data.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=IF(B2&amp;gt;50,"Pass","Fail")
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Real-world uses include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;grading systems&lt;/li&gt;
&lt;li&gt;customer segmentation&lt;/li&gt;
&lt;li&gt;performance classification&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  VLOOKUP Function
&lt;/h3&gt;

&lt;p&gt;Used to retrieve matching values from another table.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;=VLOOKUP(A2,Sheet2!A:B,2,FALSE)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Analysts commonly use this when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;merging customer records&lt;/li&gt;
&lt;li&gt;matching product IDs&lt;/li&gt;
&lt;li&gt;combining datasets from multiple sheets&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  4. Data Visualization
&lt;/h2&gt;

&lt;p&gt;Excel allows analysts to create charts such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Bar charts&lt;/li&gt;
&lt;li&gt;Line charts&lt;/li&gt;
&lt;li&gt;Pie charts&lt;/li&gt;
&lt;li&gt;Column charts&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These visualizations make it easier to communicate findings clearly to stakeholders who may not have technical backgrounds.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhzdbsgk6kgnfx9onxr1r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fhzdbsgk6kgnfx9onxr1r.png" alt="Excel column chart showing monthly sales trends across three months"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Example Excel line chart showing average salary across month of hire.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  5. Conditional Formatting for Insight Discovery
&lt;/h2&gt;

&lt;p&gt;Conditional formatting highlights important values automatically.&lt;/p&gt;

&lt;p&gt;Example uses:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;highlighting top-performing regions&lt;/li&gt;
&lt;li&gt;identifying missing values&lt;/li&gt;
&lt;li&gt;detecting unusually high expenses&lt;/li&gt;
&lt;li&gt;spotting declining performance trends&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2irljn0zj97odxzjhfr8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2irljn0zj97odxzjhfr8.png" alt="Excel dataset using conditional formatting to highlight high and low values"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Example of conditional formatting that highlights performance score greater than 5&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;This improves readability and makes patterns visible immediately.&lt;/p&gt;




&lt;h2&gt;
  
  
  Features I Have Learned and Applied When Working with Data
&lt;/h2&gt;

&lt;p&gt;While learning Excel, I gained experience using several important features that support data analysis workflows.&lt;/p&gt;

&lt;p&gt;Pivot Tables helped me summarize datasets quickly and identify trends without writing complex formulas.&lt;/p&gt;

&lt;p&gt;The IF function allowed me to categorize data into meaningful groups, making it easier to interpret results.&lt;/p&gt;

&lt;p&gt;VLOOKUP helped me combine information from multiple tables, which is especially useful when working with relational datasets.&lt;/p&gt;

&lt;p&gt;Conditional formatting helped highlight important values such as high-performing regions or missing data points, making datasets easier to understand visually.&lt;/p&gt;

&lt;h2&gt;
  
  
  These tools significantly improved my ability to explore and interpret datasets efficiently.
&lt;/h2&gt;

&lt;h2&gt;
  
  
  Personal Reflection
&lt;/h2&gt;

&lt;p&gt;Learning Excel has changed the way I approach data. Previously, I mainly viewed data as numbers arranged in tables, but Excel helped me understand how structured analysis can reveal meaningful insights and patterns.&lt;/p&gt;

&lt;p&gt;Through practicing formulas, Pivot Tables, and visualization techniques, I became more confident in cleaning datasets, summarizing information, and presenting findings clearly. Excel also strengthened my problem-solving skills by teaching me how to break down datasets into manageable steps during analysis.&lt;/p&gt;

&lt;p&gt;As someone building a career in data analytics and data science, learning Excel has been an important step in developing my analytical thinking. It has helped me move from simply observing data to actively exploring it and using it to support informed decisions.&lt;/p&gt;

&lt;p&gt;Overall, Excel has become a valuable foundation in my data analysis journey, and it continues to support my growth as I expand my skills into more advanced tools and technologies.&lt;/p&gt;

</description>
      <category>beginners</category>
      <category>datascience</category>
      <category>analytics</category>
      <category>analyst</category>
    </item>
  </channel>
</rss>
