<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Ramya Perumal</title>
    <description>The latest articles on Forem by Ramya Perumal (@ramya_perumal_e93721ef2fa).</description>
    <link>https://forem.com/ramya_perumal_e93721ef2fa</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3900955%2F3e2feb4c-f889-4df6-b8ef-5b1a1ac619ca.png</url>
      <title>Forem: Ramya Perumal</title>
      <link>https://forem.com/ramya_perumal_e93721ef2fa</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/ramya_perumal_e93721ef2fa"/>
    <language>en</language>
    <item>
      <title>Introduction to RAG</title>
      <dc:creator>Ramya Perumal</dc:creator>
      <pubDate>Mon, 27 Apr 2026 19:45:04 +0000</pubDate>
      <link>https://forem.com/ramya_perumal_e93721ef2fa/introduction-to-rag-4d0a</link>
      <guid>https://forem.com/ramya_perumal_e93721ef2fa/introduction-to-rag-4d0a</guid>
      <description>&lt;p&gt;RAG is Retrieval-Augmented Generation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What is a Model?&lt;/strong&gt;&lt;br&gt;
A model is nothing but an equation.&lt;br&gt;
Example:&lt;br&gt;
y=mx+c&lt;br&gt;
During training, values of x and y will be provided. The model has to find the appropriate values of m and c and try to make a line that best fits the graph. The values of m and c may vary depending on the use case.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What is a Parameter?&lt;/strong&gt;&lt;br&gt;
A parameter is nothing but a variable that is learned during training.&lt;br&gt;
In the above equation:&lt;br&gt;
m is a parameter&lt;br&gt;
c is a parameter&lt;/p&gt;

&lt;p&gt;If the number of parameters is more, the model can learn more complex patterns.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What is Temperature&lt;/strong&gt;&lt;br&gt;
Temperature controls the model's creativity. It usually ranges from 0 to 1.&lt;br&gt;
Lower temperature gives more factual answers.&lt;br&gt;
Higher temperature gives more imaginative answers.&lt;/p&gt;

&lt;p&gt;Temperature is passed along with the prompt input.&lt;br&gt;
Usually, it is kept around 0.5 for balanced output.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;SLM&lt;/strong&gt;&lt;br&gt;
SLM stands for Small Language Model.&lt;br&gt;
It usually has fewer billion parameters and is trained for a particular domain or specific tasks.&lt;br&gt;
Training cost can still be high, similar to LLMs, depending on the use case.&lt;br&gt;
Example: smallest ai - provides voice-based smaller AI models.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;LLM&lt;/strong&gt;&lt;br&gt;
LLM stands for Large Language Model.&lt;br&gt;
It usually has billions of parameters and contains knowledge from many domains. It is called a generalized model.&lt;br&gt;
Example: gpt-oss-120b.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;How LLM Works&lt;/em&gt;&lt;br&gt;
The primary functionality of an LLM is to predict the next word correctly.&lt;br&gt;
It generates text by predicting one word after another based on previous words.&lt;br&gt;
Sometimes LLMs generate incorrect information confidently. This is called hallucination.&lt;br&gt;
Example:&lt;br&gt;
If the model knows about cats and dogs but has limited knowledge about lions, it may generate irrelevant or incorrect content.&lt;br&gt;
Hallucination can be reduced by writing proper prompts and providing correct context.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What is RAG?&lt;/strong&gt;&lt;br&gt;
RAG stands for Retrieval-Augmented Generation.&lt;/p&gt;

&lt;p&gt;It is a method used to provide private or external knowledge such as:&lt;br&gt;
Company policies&lt;br&gt;
HR policy documents&lt;br&gt;
Internal business documents&lt;/p&gt;

&lt;p&gt;This information is given to the LLM so it can generate human-readable answers based on that content.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Where is Private Data Stored?&lt;/em&gt;&lt;br&gt;
Private data is usually stored in a database called a Vector Database.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;How Documents are Stored&lt;/em&gt;&lt;br&gt;
Documents are split into smaller parts called chunks.&lt;br&gt;
These chunks are converted into numerical vectors and stored in the vector database.&lt;/p&gt;

&lt;p&gt;To search relevant chunks quickly, algorithms like:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;ANN&lt;/strong&gt; (Approximate Nearest Neighbors)&lt;br&gt;
&lt;strong&gt;KNN&lt;/strong&gt; (K-Nearest Neighbors) &lt;br&gt;
are commonly used.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>machinelearning</category>
      <category>rag</category>
    </item>
  </channel>
</rss>
