<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: Pranay Prateek</title>
    <description>The latest articles on Forem by Pranay Prateek (@pranay01).</description>
    <link>https://forem.com/pranay01</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F252086%2F0e28cde3-382d-422c-8f77-dee68b6e09e7.jpg</url>
      <title>Forem: Pranay Prateek</title>
      <link>https://forem.com/pranay01</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/pranay01"/>
    <language>en</language>
    <item>
      <title>SigNoz : Open-source alternative to DataDog</title>
      <dc:creator>Pranay Prateek</dc:creator>
      <pubDate>Sat, 06 Mar 2021 14:38:09 +0000</pubDate>
      <link>https://forem.com/signoz/signoz-open-source-alternative-to-datadog-1kd8</link>
      <guid>https://forem.com/signoz/signoz-open-source-alternative-to-datadog-1kd8</guid>
      <description>&lt;p&gt;More and more companies are now shifting to a cloud-native &amp;amp; microservices based architecture. Having an application monitoring tool is critical in this world because you can’t just log into a machine and figure out what’s going wrong.&lt;/p&gt;

&lt;p&gt;We have spent the last couple of years learning about application monitoring &amp;amp; observability. What are the key features an observability tool should have to enable fast resolution of issues.&lt;/p&gt;

&lt;p&gt;In our opinion, a good observability tools should have&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Out of the box application metrics&lt;/li&gt;
&lt;li&gt;Way to go from metrics to traces to find why some issues are happening&lt;/li&gt;
&lt;li&gt;Seamless flow between metrics, traces &amp;amp; logs — the three pillars of observability&lt;/li&gt;
&lt;li&gt;Filtering of traces based on different tag and filters&lt;/li&gt;
&lt;li&gt;Ability to set dynamic thresholds for alerts&lt;/li&gt;
&lt;li&gt;Transparency in pricing&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  User experience not great in current open-source tools
&lt;/h2&gt;

&lt;p&gt;We found that though there are open-source tools like Prometheus &amp;amp; Jaeger, they don’t provide great user experience like SaaS products do. It takes lots of time and effort to get them working, figuring out the long term storage, etc. And if you want metrics and traces, it’s not possible as Prometheus metrics &amp;amp; Jaeger traces have different formats.&lt;br&gt;
SaaS tools like DataDog and NewRelic do a much better job at many of these aspects&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;They are easy to setup &amp;amp; get started&lt;/li&gt;
&lt;li&gt;Provide out-of-box application metrics&lt;/li&gt;
&lt;li&gt;Provides correlation between metrics &amp;amp; traces&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But it has the following issues&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Crazy node based pricing which doesn’t make sense in today’s micro-services architecture. Any node which is live for more than 8hrs in a month is charged. So, unsuitable for spiky workloads&lt;/li&gt;
&lt;li&gt;Very costly. They charge custom metrics for $5/100 metrics&lt;/li&gt;
&lt;li&gt;It is cloud only, so not suitable for companies which have concerns with sending data outside their infra&lt;/li&gt;
&lt;li&gt;For any small feature, you are dependent on their roadmap. We think this is an unnecessary restriction for a product which is used by developers. A product used by developers should be extendible&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;To fill this gap we built &lt;a href="https://signoz.io" rel="noopener noreferrer"&gt;SigNoz&lt;/a&gt;, an open-source alternative to DataDog.&lt;/p&gt;

&lt;p&gt;Some of our key features which makes us vastly superior to current open-source products&lt;/p&gt;

&lt;h2&gt;
  
  
  Out of the box application metrics
&lt;/h2&gt;

&lt;p&gt;Get p90, p99 latencies, RPS, Error rates and top endpoints for a service out of the box.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F87h4jc87f708uf4hzz29.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F87h4jc87f708uf4hzz29.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Seamless flow between metrics &amp;amp; traces
&lt;/h2&gt;

&lt;p&gt;Found something suspicious in a metric, just click that point in the graph &amp;amp; get details of traces which may be causing the issues. Seamless, Intuitive.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqffkraz3599r2qlsi9g0.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqffkraz3599r2qlsi9g0.png" alt="Out-of-box application metrics"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Filtering based on tags
&lt;/h2&gt;

&lt;p&gt;for example you can find latency experienced by customers who have customer_type set as premium&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1y634yg27orf80nkqqg5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1y634yg27orf80nkqqg5.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Custom aggregates on filtered traces
&lt;/h2&gt;

&lt;p&gt;Create custom metrics from filtered traces to find metrics of any type of requests. Want to find p99 latency of customer_type: premium who are seeing status_code:400. Just set the filters, and you have the graph. Boom!&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fohzj4k73zmpq8t474rd1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fohzj4k73zmpq8t474rd1.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Transparent usage Data
&lt;/h2&gt;

&lt;p&gt;You can drill down details of how many events is each application sending or at what granularity, so that you can adjust your sampling rate as needed and not get a shock at the end of the month ( case with SaaS vendors many a times)&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvcwprvmbzyzl1f3qajrn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fvcwprvmbzyzl1f3qajrn.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Detailed Flamegraphs
&lt;/h2&gt;

&lt;p&gt;Detailed flamegraph to find exact cause of the issue, and which of the underlying requests is causing the problem. Is it a SQL query gone rogue or a redis operation is causing an issue&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe44qfvqt8roewww4e8nb.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe44qfvqt8roewww4e8nb.png" alt="Alt Text"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Check out our &lt;a href="https://github.com/signoz/signoz" rel="noopener noreferrer"&gt;Github repo&lt;/a&gt; &amp;amp; give it a try. We would love any feedback on what you like or what doesn’t make sense. We are also active on &lt;a href="https://join.slack.com/t/signoz-community/shared_invite/zt-lrjknbbp-J_mI13rlw8pGF4EWBnorJA" rel="noopener noreferrer"&gt;Slack&lt;/a&gt;, so give us a shout out there and we would be happy to answer any questions or help you set things up.&lt;/p&gt;

</description>
      <category>devops</category>
      <category>sre</category>
      <category>showdev</category>
      <category>kubernetes</category>
    </item>
    <item>
      <title>Monitoring and Observability related questions? (What should I write about?)</title>
      <dc:creator>Pranay Prateek</dc:creator>
      <pubDate>Sun, 10 May 2020 12:04:18 +0000</pubDate>
      <link>https://forem.com/pranay01/monitoring-and-observability-related-questions-what-should-i-write-about-5cjl</link>
      <guid>https://forem.com/pranay01/monitoring-and-observability-related-questions-what-should-i-write-about-5cjl</guid>
      <description>&lt;p&gt;Hello!&lt;/p&gt;

&lt;p&gt;I'm the co-founder of &lt;a href="https://signoz.io"&gt;SigNoz&lt;/a&gt; , a lightweight application monitoring tools and I live and breath monitoring &amp;amp; observability :)&lt;/p&gt;

&lt;p&gt;I'd like to contribute to the Dev community, answering questions relating to monitoring &amp;amp; observability.&lt;/p&gt;

&lt;p&gt;What questions do you all have? / What should I write about?&lt;/p&gt;

&lt;p&gt;Pranay&lt;/p&gt;

</description>
      <category>devops</category>
      <category>microservices</category>
      <category>monitoring</category>
      <category>sre</category>
    </item>
    <item>
      <title>Ask DEV: LightWeight APM for Kubernetes using OpenTelemetry?</title>
      <dc:creator>Pranay Prateek</dc:creator>
      <pubDate>Thu, 02 Apr 2020 06:52:48 +0000</pubDate>
      <link>https://forem.com/pranay01/ask-dev-lightweight-apm-for-kubernetes-using-opentelemetry-n6j</link>
      <guid>https://forem.com/pranay01/ask-dev-lightweight-apm-for-kubernetes-using-opentelemetry-n6j</guid>
      <description>&lt;p&gt;After going through monitoring and tracing solutions in Prometheus, DataDog, NewRelic, and other players like LightStep, HoneyComb, Instana, etc, I still don't see a product that is simple and easy to use for people who don't need to do the heavyweight RCA.&lt;/p&gt;

&lt;p&gt;DataDog still remains the only option for companies that spend in the ticket size &amp;lt; 2000 USD per month in APM solutions, but they seem to be very complex to me. Another option is shifting to OSS tools using Prometheus, OpenTracing, OpenTelemetry. But then you need to spend a lot of time in learning PromQL, HA setup, Storage and building Grafana dashboard.&lt;/p&gt;

&lt;p&gt;All vendors doing tracing don't seem to sample data to enable metrics collected over traces and enable RCA which come at a huge cost of storage (the pricing plan of these vendors can make a small company sweat). Sending data when my application is running fine seems to add little value to cost.&lt;/p&gt;

&lt;p&gt;I see a product gap that tries to address the low ticket-size users (&amp;lt; $2000 spend per month on APM) of all APM players with below plans and is based on OSS tools like Prometheus/OpenTracing/Opentelemetry:&lt;/p&gt;

&lt;p&gt;Plan 1 - 40% of the cost by other vendors (only Metrics) - Converting OpenTracing instrumentation to useful Prometheus metrics like in chapter 11 of Mastering Distributed Tracing. A rather detailed metrics from APM perspective like RPS + Latencies + Slowest queries of Redis, Mongo, MySql, etc. Also, metrics aggregated by endpoints of the application.&lt;/p&gt;

&lt;p&gt;Plan 2 - 60% of the cost by other vendors (Metrics + sampled traces) - Tail Based Sampling based on anomaly found by gathered metrics from plan 1. This will send only the trace needed for debugging the anomaly and thus will be a huge cost saver.&lt;/p&gt;

&lt;p&gt;Plan 3 - 100% of the cost by other vendors (100% of traces) - Full-fledged enterprise plan sending full traces for RCA and better debugging.&lt;/p&gt;

&lt;p&gt;My current understanding is all the APM players are focussing on higher ticket customers (Plan 3) right now.&lt;/p&gt;

&lt;p&gt;Wondering if a solution that is lightweight and cost-effective ( &amp;lt; 2000 USD per month) built natively for Kubernetes will be interesting to you? What are the features you would like to see there?&lt;/p&gt;

</description>
      <category>discuss</category>
      <category>sre</category>
      <category>devops</category>
      <category>kubernetes</category>
    </item>
    <item>
      <title>Folks, what are some conferences in DevOps/SRE space that you look forward to? </title>
      <dc:creator>Pranay Prateek</dc:creator>
      <pubDate>Mon, 10 Feb 2020 13:12:49 +0000</pubDate>
      <link>https://forem.com/pranay01/folks-what-are-some-conferences-in-devops-sre-space-that-you-look-forward-to-4li2</link>
      <guid>https://forem.com/pranay01/folks-what-are-some-conferences-in-devops-sre-space-that-you-look-forward-to-4li2</guid>
      <description>&lt;p&gt;This is what I have till now. Please add any suggestions in comments&lt;/p&gt;

&lt;p&gt;&lt;a href="https://res.cloudinary.com/practicaldev/image/fetch/s--GBHoGEds--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://pbs.twimg.com/media/EQaLqEiUYAYdFOO%3Fformat%3Djpg%26name%3D900x900" class="article-body-image-wrapper"&gt;&lt;img src="https://res.cloudinary.com/practicaldev/image/fetch/s--GBHoGEds--/c_limit%2Cf_auto%2Cfl_progressive%2Cq_auto%2Cw_880/https://pbs.twimg.com/media/EQaLqEiUYAYdFOO%3Fformat%3Djpg%26name%3D900x900" alt="Image"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>kubernetes</category>
      <category>sre</category>
      <category>devops</category>
    </item>
  </channel>
</rss>
