<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Allan Ramirez</title>
    <description>The latest articles on Forem by Allan Ramirez (@allanthelearner).</description>
    <link>https://forem.com/allanthelearner</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1699839%2Ff24b78fb-de50-4932-b3ff-6a245e15bced.jpg</url>
      <title>Forem: Allan Ramirez</title>
      <link>https://forem.com/allanthelearner</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/allanthelearner"/>
    <language>en</language>
    <item>
      <title>My first Steps in the world of Data Engineering</title>
      <dc:creator>Allan Ramirez</dc:creator>
      <pubDate>Fri, 28 Jun 2024 18:21:02 +0000</pubDate>
      <link>https://forem.com/allanthelearner/my-first-steps-in-the-world-of-data-engineering-1b32</link>
      <guid>https://forem.com/allanthelearner/my-first-steps-in-the-world-of-data-engineering-1b32</guid>
      <description>&lt;p&gt;Data is the new oil they say....&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyuah14dseal1v31sobi6.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyuah14dseal1v31sobi6.gif" alt="Image description" width="498" height="280"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h3&gt;
  
  
  Table Of Contents
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Introduction&lt;/li&gt;
&lt;li&gt;Data Engineer, What is it?&lt;/li&gt;
&lt;li&gt;Pipelines in Focus&lt;/li&gt;
&lt;li&gt;ETL VS LTE&lt;/li&gt;
&lt;li&gt;Data  Lake vs Data warehouse&lt;/li&gt;
&lt;li&gt;Reflection&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;For my Capstone,I was given the opportunity from my Fellowship to explore a branch that not normally from our curriculum, Data Engineering. Since I am in a ever lasting journey of self knowledge, and figuring out who am i as a person, I knew I had to take this chance to explore my options, and I was up for the task!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fodyncvx90w1kx474a6uk.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fodyncvx90w1kx474a6uk.gif" alt="Image description" width="498" height="250"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Data Engineer, What is it?
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw4qll6i0idnvzeda99y8.gif" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw4qll6i0idnvzeda99y8.gif" alt="Image description" width="498" height="375"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Okay, so what exactly is a data engineer? In a nutshell from my understanding, we the architects and builders of the data world. We design, construct, and maintain the pipelines that transform raw data into valuable insights.  Think of us like the plumbers of the digital age, but instead of pipes and water, we work with data streams and information flow.&lt;/p&gt;

&lt;p&gt;So far, I've learned that data engineering isn't just about technical skills. It's about understanding the bigger picture of how data is used to drive decisions and solve problems.  We're the bridge between the raw information and the people who need it, and that's pretty cool.&lt;/p&gt;




&lt;h2&gt;
  
  
  Pipeline in Focus
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh7a6nei39j08205bdo53.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh7a6nei39j08205bdo53.png" alt="Image description" width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;One of the most fundamental concepts I've encountered is the data pipeline.  it's like an assembly line, but for data. Raw data comes in, gets cleaned, transformed, and then loaded into a place where it can be analyzed.&lt;/p&gt;

&lt;p&gt;There are two main types of pipelines:&lt;/p&gt;

&lt;p&gt;-&lt;strong&gt;ETL (Extract, Transform, Load):&lt;/strong&gt; This is the traditional approach where data is extracted from its source, transformed into a usable format, and then loaded into a data warehouse.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;ELT (Extract, Load, Transform):&lt;/strong&gt;This is a newer approach where data is extracted and loaded into a data lake first, and then transformed as needed.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Understanding how pipelines work and the differences between ETL and ELT is essential for any aspiring data engineer. It's the foundation for building efficient and reliable systems that can handle large volumes of data.&lt;/p&gt;




&lt;h2&gt;
  
  
  Data Lake vs Data Warehouse
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftg6d71bitveo9y6bieje.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ftg6d71bitveo9y6bieje.png" alt="Image description" width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Now, let's talk about the difference between a data lake and a data warehouse. It's like comparing a vast, wild lake to a well-organized warehouse.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Data Lake:&lt;/strong&gt; This is where you dump all your raw, unprocessed data, regardless of its format or structure. It's a treasure trove of potential insights waiting to be discovered. Think of it as a giant sandbox where you can experiment and play with data before deciding what to do with it.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;-&lt;strong&gt;Data Warehouse:&lt;/strong&gt; This is a more structured environment where data is cleaned, organized, and optimized for analysis. It's like a carefully curated library where you can easily find the information you need.&lt;/p&gt;

&lt;p&gt;Both data lakes and data warehouses have their advantages. Data lakes are great for storing massive amounts of diverse data, while data warehouses are better for quick and easy analysis. The choice depends on your specific needs and use cases.&lt;/p&gt;

&lt;h3&gt;
  
  
  Reflection
&lt;/h3&gt;

&lt;p&gt;So far, That what I learn in my week!&lt;/p&gt;

&lt;p&gt;I have definitely learned a-lot about the power that data has and how it is managed! I can not wait to continue this path that I am going and I make sure to keep you guys Updated ! Thank you so much for reading! &lt;/p&gt;

&lt;p&gt;Stay  learning!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8rs0xublo3rldwbqu4nx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/cdn-cgi/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8rs0xublo3rldwbqu4nx.png" alt="Image description" width="220" height="220"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>database</category>
      <category>data</category>
      <category>tutorial</category>
    </item>
  </channel>
</rss>
