<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>Forem: lidianycs</title>
    <description>The latest articles on Forem by lidianycs (@lidianycs).</description>
    <link>https://forem.com/lidianycs</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F985550%2Ffea388cc-923a-46e8-b9f9-3aac294bde8b.jpeg</url>
      <title>Forem: lidianycs</title>
      <link>https://forem.com/lidianycs</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://forem.com/feed/lidianycs"/>
    <language>en</language>
    <item>
      <title>How I Built a Tool to Detect AI-Generated Fake References</title>
      <dc:creator>lidianycs</dc:creator>
      <pubDate>Mon, 05 Jan 2026 16:27:42 +0000</pubDate>
      <link>https://forem.com/lidianycs/how-i-built-a-tool-to-detect-ai-generated-fake-references-2891</link>
      <guid>https://forem.com/lidianycs/how-i-built-a-tool-to-detect-ai-generated-fake-references-2891</guid>
      <description>&lt;p&gt;Large Language Models(LLMs) have become part of everyday academic and technical writing. But there is a problem the &lt;a href="https://www.rollingstone.com/culture/culture-features/ai-chatbot-journal-research-fake-citations-1235485484/" rel="noopener noreferrer"&gt;academic community has been flagging for a while&lt;/a&gt;, and many of us have encountered it firsthand: LLMs are very good at inventing citations. They may look plausible and almost match real papers. But they confidently cite work that does not exist at all. The academic community is calling them &lt;em&gt;Ghost References&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;

&lt;/p&gt;
&lt;blockquote class="bluesky-embed"&gt;
&lt;p&gt;Closing out my year with a journal editor shocker 🧵

Checking new manuscripts today I reviewed a paper attributing 2 papers to me I did not write. A daft thing for an author to do of course. But intrigued I web searched up one of the titles and that's when it got real weird...&lt;/p&gt;— &lt;a href="https://bsky.app/profile/did:plc:rl2szulxujlgdcmx4avx7jyn?ref_src=embed" rel="noopener noreferrer"&gt;Ben Williamson (@benpatrickwill.bsky.social)&lt;/a&gt; &lt;a href="https://bsky.app/profile/did:plc:rl2szulxujlgdcmx4avx7jyn/post/3mae76wbmcs2n?ref_src=embed" rel="noopener noreferrer"&gt;2025-12-19T17:20:04.127Z&lt;/a&gt;
&lt;/blockquote&gt;




&lt;p&gt;As Professor &lt;a href="https://www.linkedin.com/posts/ben-williamson-7a501b329_why-ghost-references-still-haunt-us-in-2025-activity-7409278908543700993-UMp-?utm_source=social_share_send&amp;amp;utm_medium=member_desktop_web&amp;amp;rcm=ACoAADPYmjUBpwl8s3ILc7e0mxeJeqduUnrd8iQ" rel="noopener noreferrer"&gt;Ben Williamson&lt;/a&gt; and &lt;a href="https://aarontay.substack.com/p/why-ghost-references-still-haunt" rel="noopener noreferrer"&gt;Aaron Tay explained&lt;/a&gt;, the root problem is deep-seated:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"The ghost reference problem is a chronic condition that has become acute. The infection predates GenAI; the technology has simply lowered our immune response while accelerating transmission."&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The issue is compounded because LLMs with general web search capabilities can fail to reliably verify references, as the web itself contains fake citations, creating a dangerous feedback loop. Despite being wrong, the sources are widely assumed to be authentic, the more they appear in published literature. For instance, one of the &lt;em&gt;Ghost References&lt;/em&gt; to Prof. Williamson's work has accumulated &lt;a href="https://bsky.app/profile/did:plc:rl2szulxujlgdcmx4avx7jyn/post/3mae7iq3ccc2n?ref_src=embed" rel="noopener noreferrer"&gt;43 citations in Google Scholar&lt;/a&gt;. &lt;/p&gt;

&lt;h2&gt;
  
  
  Addressing the Reviewer's Burden
&lt;/h2&gt;

&lt;p&gt;Peer reviewers are already stretched thin, and now, due to the proliferation of fake references, &lt;a href="https://www.linkedin.com/posts/jan-laksar-015173b9_my-first-encounter-with-ai-generated-activity-7401270725833019392-D5dC?utm_source=share&amp;amp;utm_medium=member_desktop&amp;amp;rcm=ACoAADPYmjUBpwl8s3ILc7e0mxeJeqduUnrd8iQ" rel="noopener noreferrer"&gt;they have to manually copy-paste every single reference&lt;/a&gt; into a search engine to verify its existence.&lt;/p&gt;

&lt;p&gt;This is a tedious, low-reward task often skipped in favor of focusing on the paper's actual content. But this "verification gap" is exactly where ghost references can slip through.&lt;/p&gt;

&lt;h2&gt;
  
  
  When It Happened to Me
&lt;/h2&gt;

&lt;p&gt;That abstract concern turned into a concrete problem worth addressing &lt;a href="https://www.linkedin.com/posts/lidianycs_academichumor-aiinscience-researchethics-activity-7394350857963679744-tSmn?utm_source=share&amp;amp;utm_medium=member_desktop&amp;amp;rcm=ACoAADPYmjUBpwl8s3ILc7e0mxeJeqduUnrd8iQ" rel="noopener noreferrer"&gt;when I discovered my own paper had been incorrectly cited in a published paper&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Seeing the flawed metadata published in a journal was a wake-up call that led me to build &lt;strong&gt;&lt;a href="https://github.com/lidianycs/cerca" rel="noopener noreferrer"&gt;CERCA&lt;/a&gt;&lt;/strong&gt;, an open-source tool designed to assist researchers, reviewers, and editors in quickly verifying the accuracy of references. It was developed to improve trust, transparency, and reliability academic writing.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Is CERCA?
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://github.com/lidianycs/cerca" rel="noopener noreferrer"&gt;CERCA&lt;/a&gt; stands for &lt;strong&gt;Citation Extraction &amp;amp; Reference Checking Assistant&lt;/strong&gt;. &lt;/p&gt;

&lt;p&gt;Here's what it looks like in action:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffkg4bu2ygpzcpkpofabj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffkg4bu2ygpzcpkpofabj.png" alt="Cerca checking a file"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;In seconds, CERCA:&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Scans a PDF and extracts the references&lt;/li&gt;
&lt;li&gt;Queries OpenAlex, Crossref, and Zenodo&lt;/li&gt;
&lt;li&gt;Flags potentially invalid citations with confidence scores&lt;/li&gt;
&lt;li&gt;Shows you which metadata fields don't match&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Instead of copy-pasting each reference manually, you get a verification report you can review in minutes.CERCA automates the tedious process of verifying whether the papers cited in a PDF file actually exist and if the metadata is accurate. &lt;/p&gt;

&lt;h2&gt;
  
  
  Development Insights
&lt;/h2&gt;

&lt;p&gt;Building CERCA required solving a few interesting engineering challenges, particularly around fuzzy matching and bibliographic parsing.&lt;/p&gt;

&lt;p&gt;Academic citations are messy. They come in dozens of formats (APA, MLA, IEEE, ACM, Vancouver, etc.). Creating a parser that could reliably extract these references without false positives was the first hurdle. I used &lt;a href="http://cermine.ceon.pl/index.html" rel="noopener noreferrer"&gt;Cermine&lt;/a&gt;, a Java library, to handle the heavy lifting of PDF parsing and metadata extraction.&lt;/p&gt;

&lt;p&gt;The second was the verification logic. I used fuzzy matching to determine if a citation is close enough to be a typo or far enough to be a hallucination. Here's what the tool can detect:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Cerqueira, M.; Tavares, A.; Couto, C.; Maciel, R.; Santos, D.; Figueira, A. "Assessing software practitioners' work engagement and job satisfaction.&lt;/em&gt;" [&lt;strong&gt;Example of Ghost Citation&lt;/strong&gt;]&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;CERCA detects:&lt;/strong&gt;&lt;br&gt;
⚠️ Author list mismatch (6 fabricated, 9 omitted)&lt;br&gt;
⚠️ Title incomplete &lt;br&gt;
⚠️ First author name inconsistency&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Correct paper reference:&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Cerqueira, L., Nunes, L., Guerra, R., Malheiros, V., Freire, S., Carneiro, G., ... &amp;amp; Mendonça, M. (2025). Assessing Software Practitioners’ Work Engagement and Job Satisfaction in a Large Software Company—What We Have Learned. SN Computer Science, 6(3), 273.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;🗃️ CERCA queries trusted repositories (OpenAlex, Crossref, Zenodo) and &lt;br&gt;
uses fuzzy matching to catch these discrepancies, saving reviewers from &lt;br&gt;
manually checking each citation. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;🔍 Manual Fallback:&lt;/strong&gt; If automatic search fails, you can right-click to &lt;br&gt;
search for reference titles manually.&lt;/p&gt;

&lt;p&gt;🔐 Due to reviewers' confidentiality, I put privacy first by design: PDFs are not uploaded and never leave your machine. All PDF parsing and reference extraction are performed locally.&lt;/p&gt;

&lt;h3&gt;
  
  
  Tech Stack
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Java + JavaFX&lt;/strong&gt; – Cross-platform desktop application&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cermine&lt;/strong&gt; – PDF parsing and metadata extraction&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OpenAlex, Crossref, Zenodo APIs&lt;/strong&gt; – Reference verification&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;JavaWuzzy&lt;/strong&gt; – Handles citation variations and typos &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I chose this stack to build a Java desktop app using JavaFX for cross-platform compatibility (Windows, Mac, Linux).&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Open Source?
&lt;/h2&gt;

&lt;p&gt;Due to the tool's purpose, it must be transparent itself. Besides, this is a collective problem. By making CERCA open source, I'm inviting the community to audit the code, improve the parsers, and integrate more databases.  &lt;/p&gt;

&lt;p&gt;It is licensed under the &lt;strong&gt;GNU Affero General Public License&lt;/strong&gt; (AGPL-3.0).&lt;/p&gt;

&lt;h2&gt;
  
  
  Who Can Use CERCA?
&lt;/h2&gt;

&lt;p&gt;It is useful for anyone working on scholarly or technical writing. It is intended for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Researchers&lt;/strong&gt; performing final manuscript checks&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Reviewers&lt;/strong&gt; assessing reference consistency&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Editors&lt;/strong&gt; supporting editorial quality control&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Meta-research&lt;/strong&gt; and reproducibility workflows&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Join the Effort
&lt;/h2&gt;

&lt;p&gt;Ghost references are threatening scholarly trust. CERCA is a start, but it &lt;br&gt;
needs your expertise:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Try it now:&lt;/strong&gt; &lt;br&gt;
📥 &lt;a href="https://github.com/lidianycs/cerca/releases/tag/v1.1-alpha" rel="noopener noreferrer"&gt;Download CERCA&lt;/a&gt; &lt;br&gt;
(Windows | Mac | Linux) &lt;/p&gt;

&lt;p&gt;Cerca does not solve the problem of ghost references, and it is not yet finished. It is a small, practical step. If it helps a researcher catch one incorrect reference, saves a reviewer time, or encourages more critical engagement with AI-generated text, then it is already serving its purpose. But you can &lt;strong&gt;help improve it:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🐛 Found an edge case? &lt;/li&gt;
&lt;li&gt;💡 Have ideas? &lt;/li&gt;
&lt;li&gt;🔧 Want to contribute? &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;👉🏾 Download the tool and explore the &lt;a href="https://github.com/lidianycs/cerca" rel="noopener noreferrer"&gt;repository here&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This project is a work in progress and an invitation to the research and developer communities to experiment, evaluate, and build better tools together. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Share your results:&lt;/strong&gt; Did CERCA catch a ghost reference in your work? I'd love to hear about it in the comments.&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>java</category>
      <category>github</category>
      <category>programming</category>
    </item>
    <item>
      <title>What Does Empathy Really Mean in Software Development?</title>
      <dc:creator>lidianycs</dc:creator>
      <pubDate>Tue, 15 Jul 2025 16:15:27 +0000</pubDate>
      <link>https://forem.com/lidianycs/what-does-empathy-really-mean-in-software-development-36j6</link>
      <guid>https://forem.com/lidianycs/what-does-empathy-really-mean-in-software-development-36j6</guid>
      <description>&lt;p&gt;Empathy is often seen as a “nice-to-have” in tech, but what if it’s actually essential? In the first part of our articles about empathy in software engineering, we explore how developers define empathy, in their own words.&lt;/p&gt;

&lt;p&gt;We just published a new study that explores what empathy looks like from the perspective of real software practitioners, using 55 blog posts from communities like DEV and Medium, plus insights from a follow-up survey with empathy experts.&lt;/p&gt;

&lt;p&gt;The study was recently accepted at ACM Transactions on Software Engineering and Methodology (TOSEM), a leading peer-reviewed journal for high-quality research in software engineering. Read the preprint here:&lt;br&gt;
👉 &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;Exploring Empathy in Software Engineering&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;👀 Why Study Empathy?&lt;/strong&gt;&lt;br&gt;
Empathy helps us communicate, collaborate, and work better as teams. But in software engineering, it's often overlooked or misunderstood. We wanted to change that by listening to developers who’ve reflected publicly on their experiences.&lt;/p&gt;

&lt;p&gt;🔍&lt;strong&gt;What We Found&lt;/strong&gt;&lt;br&gt;
Through qualitative content analysis, we identified how empathy shows up, where it breaks down, and what it can achieve when practiced well.&lt;/p&gt;

&lt;h2&gt;
  
  
  💡 &lt;strong&gt;5 Meanings of Empathy in SE&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;From these reflections, five key themes emerged:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;Understanding – the most common definition: grasping how someone thinks or feels&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;“[Empathy is] the ability to understand how a person feels and what they might be thinking.”&lt;/em&gt; —P34&lt;/p&gt;
&lt;/blockquote&gt;
&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Perspective Taking – seeing a situation through another person’s eyes&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;“It’s the ability to see things as if from another’s perspective.”&lt;/em&gt; —P39&lt;/p&gt;
&lt;/blockquote&gt;

&lt;ul&gt;
&lt;li&gt;Embodiment – putting yourself in someone else's shoes (e.g., teammates, users)&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;“Empathy is the ability to put yourself in the other person’s shoes.”&lt;/em&gt; —P40&lt;/p&gt;
&lt;/blockquote&gt;

&lt;ul&gt;
&lt;li&gt;Compassion – caring about the people you work with&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;“Caring about the people you work with, not just the work you do.”&lt;/em&gt; —P48&lt;/p&gt;
&lt;/blockquote&gt;

&lt;ul&gt;
&lt;li&gt;Emotional Sharing – feeling what others feel, like mirroring stress or anxiety&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;“[Empathy] includes mirroring what that person is feeling.”&lt;/em&gt; —P54&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;em&gt;We refer to each practitioner by an ID number, namely P1, P2, ... Pn.&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  🧠 &lt;strong&gt;A Multi-Faceted View&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;We grouped these meanings using a well-known psychological model:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;🧠 Cognitive empathy: understanding, perspective taking, embodiment&lt;/li&gt;
&lt;li&gt;💙 Compassionate empathy: caring about others’ well-being&lt;/li&gt;
&lt;li&gt;💫 Emotional empathy: sharing someone’s emotional state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;While other models in engineering highlight understanding and perspective taking, our study uniquely includes compassion, a crucial but often overlooked aspect.&lt;/p&gt;

&lt;p&gt;✨ &lt;strong&gt;Why It Matters&lt;/strong&gt;&lt;br&gt;
This isn’t just about “being nice.” Empathy is a socio-technical skill that can improve software quality and well-being. By making empathy visible, naming its blockers, and offering strategies, our work helps teams reflect, adapt, and grow.&lt;/p&gt;

&lt;p&gt;There’s no universal definition of empathy in software engineering. By gathering how real devs talk about it, we help build a clearer and more practical understanding that reflects the human side of coding, collaborating, and caring.&lt;/p&gt;

&lt;p&gt;🙏 &lt;strong&gt;Thank You!&lt;/strong&gt;&lt;br&gt;
We’re deeply grateful to the software practitioners who shared their stories, whether through personal blog posts or by offering feedback in our follow-up survey. Your reflections gave this research meaning and depth. This study wouldn’t exist without your willingness to speak openly about the challenges and possibilities of empathy in software development.&lt;/p&gt;

&lt;p&gt;🔭 &lt;strong&gt;What’s Next?&lt;/strong&gt;&lt;br&gt;
This is just one step in our journey! Next up: &lt;strong&gt;what gets in the way of empathy at work?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;We’re also expanding this work through a mixed-methods study in software companies, combining survey data and qualitative insights to better understand how empathy is practiced in real-world teams.&lt;br&gt;
As we continue, we’re also refining and evolving the conceptual framework based on this new data, so it can be even more actionable and relevant for devs, managers, and educators.&lt;br&gt;
If you're passionate about empathy in tech or working to create more human-centered workplaces, let’s connect. And stay tuned, we’ll share more as this next phase unfolds.&lt;/p&gt;

&lt;p&gt;🙌 &lt;strong&gt;Check It Out&lt;/strong&gt;&lt;br&gt;
📖 Full paper: &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;https://arxiv.org/abs/2507.05325&lt;/a&gt;&lt;br&gt;
🔍 Dataset for replication and reuse: &lt;a href="https://doi.org/10.5281/zenodo.15800354" rel="noopener noreferrer"&gt;https://doi.org/10.5281/zenodo.15800354&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We’d love to hear your thoughts:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;em&gt;What does empathy look like in your day-to-day work?&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;Have you seen it foster (or falter) in your team?&lt;/em&gt;&lt;/li&gt;
&lt;li&gt;&lt;em&gt;What helps or hinders empathy in your org?&lt;/em&gt;&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>empathy</category>
      <category>softwareengineering</category>
      <category>research</category>
      <category>developers</category>
    </item>
    <item>
      <title>Exploring the DEV Community as a Data Source for Human Aspects in Software Engineering Research</title>
      <dc:creator>lidianycs</dc:creator>
      <pubDate>Thu, 10 Jul 2025 18:42:38 +0000</pubDate>
      <link>https://forem.com/lidianycs/using-the-dev-community-as-a-data-source-for-human-aspects-in-software-engineering-research-h56</link>
      <guid>https://forem.com/lidianycs/using-the-dev-community-as-a-data-source-for-human-aspects-in-software-engineering-research-h56</guid>
      <description>&lt;p&gt;The DEV community is a rich and underutilized data source for Software Engineering (SE) research. In my PhD, I've been using DEV articles since 2022 to build a conceptual framework of empathy in SE, capturing how empathy is perceived, practiced, and challenged across roles, tasks, and organizational settings. In this post, I’ll describe how to collect and analyze DEV.to content for research purposes, especially for those interested in the social dimensions of software work.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;You’ll learn:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Why DEV.to is a valuable source for qualitative studies&lt;/li&gt;
&lt;li&gt;How to extract articles using a Python script or Google Sheets&lt;/li&gt;
&lt;li&gt;A brief overview of how to conduct a qualitative analysis&lt;/li&gt;
&lt;li&gt;Links to open-access tools, scripts, and published studies&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Whether you're a researcher, a student, or just curious about how developers’ voices can inform SE research, this post offers practical steps and reflections to help you get started.&lt;/p&gt;

&lt;p&gt;The DEV platform, with its long-form posts written by developers, provides candid insights into how practitioners think, feel, collaborate, and grow. These reflections help us understand topics like &lt;a href="https://dl.acm.org/doi/abs/10.1145/3613372.3613407?casa_token=Wf5OizZipfoAAAAA:u4l6RvHvZjq4glxLpnV6d9xkw-M8Xma5kUw5F3uXYllYoaKlMnNVm0subeJ9D6j2bVk8TS1B1Z4" rel="noopener noreferrer"&gt;empathy&lt;/a&gt;, &lt;a href="https://ieeexplore.ieee.org/abstract/document/10475379?casa_token=LLQnwlTwH28AAAAA:MZfqSm76pBqI3GsCU-bSV6CjaY2EfDVYLWMJi9E1VSeWeYIDuzBn7hzArXsVEHzwMTpP2Kkp" rel="noopener noreferrer"&gt;well-being and mental health&lt;/a&gt;, &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;inclusion, communication, and collaboration&lt;/a&gt; in real-world software practice.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;DEV is not just a place for tutorials and code snippets! It's a vibrant community where developers openly share their experiences, struggles, and values, making it a valuable site for qualitative research.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Why DEV?
&lt;/h2&gt;

&lt;p&gt;Unlike surveys or interviews, which may be shaped by researcher framing or social desirability bias, DEV posts represent voluntary, organic reflections. These narratives often cover deeply human experiences: burnout, team dynamics, empathy, mentorship, psychological safety, and more.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Grey Literature (GL)&lt;/strong&gt;, including blog posts, articles, and forum discussions, is a valuable source of data for SE researchers, especially in areas involving human and social dimensions.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F41h8pu68rz64q1lf8t42.png" alt="Using the DEV Community as a Data Source for Human Aspects in Software Engineering Research" width="800" height="277"&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;
&lt;em&gt;The process to collect the web articles.&lt;/em&gt; &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;Source&lt;/a&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  How to Retrieve Articles from DEV?
&lt;/h2&gt;

&lt;p&gt;I've used two methods to collect articles tagged with the keyword empathy.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Python Scraper&lt;/strong&gt;: &lt;br&gt;
You can clone and run our Python script from Zenodo:&lt;/p&gt;

&lt;p&gt;&lt;code&gt;python dev_scraper.py&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;📦 The scraper and full replication package are publicly available on Zenodo:&lt;br&gt;
👉 &lt;a href="https://zenodo.org/records/15800354" rel="noopener noreferrer"&gt;https://zenodo.org/records/15800354&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Google Sheets + IMPORTJSON.gs:&lt;/strong&gt;&lt;br&gt;
If you prefer a low-code approach, use the &lt;code&gt;IMPORTJSON.gs&lt;/code&gt; script:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Download &lt;a href="https://github.com/bradjasper/ImportJSON/" rel="noopener noreferrer"&gt;the script here&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;Open a new Google Sheets document.&lt;/li&gt;
&lt;li&gt;Go to Extensions &amp;gt; Apps Script and paste the script code.&lt;/li&gt;
&lt;li&gt;Use the code below to pull articles tagged with empathy:&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;code&gt;=ImportJSON("https://dev.to/api/articles?tag=empathy&amp;amp;per_page=1000")&lt;/code&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You can change the tag or increase the &lt;code&gt;per_page&lt;/code&gt; value for broader queries.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Analyzing the Data
&lt;/h2&gt;

&lt;p&gt;Once you have the articles, you can conduct a qualitative analysis process to explore themes, perceptions, and experiences, like this:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Clean and prepare the data (remove duplicates, unrelated posts).&lt;/li&gt;
&lt;li&gt;Import into a qualitative tool (spreadsheets or tools like Atlas.ti, MAXQDA, or Taguette also work).&lt;/li&gt;
&lt;li&gt;Inductive coding to identify emerging themes.&lt;/li&gt;
&lt;li&gt;Synthesize themes to build conceptual categories (e.g., empathy practices, barriers, effects).&lt;/li&gt;
&lt;li&gt;Triangulate or validate with experts or additional sources.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This approach was adopted to build a framework of empathy in software engineering based on practitioners' own voices. 👉 &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;Check it here.&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Want to Go Further?
&lt;/h2&gt;

&lt;p&gt;If you’re interested in how DEV data can be used in real research, here are the studies using articles from the DEV community to explore empathy in SE:&lt;/p&gt;

&lt;h3&gt;
  
  
  📝 Published Studies Using DEV as Data Source
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;A Thematic Synthesis on Empathy in Software Engineering based on the Practitioners' Perspective, SBES 2023&lt;br&gt;
🔗 &lt;a href="https://doi.org/10.1145/3613372.3613407" rel="noopener noreferrer"&gt;https://doi.org/10.1145/3613372.3613407&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Empathy and Its Effects on Software Practitioners’ Well-Being and Mental Health, IEEE Software, 2024&lt;br&gt;
🔗 &lt;a href="https://doi.org/10.1109/MS.2024.3377897" rel="noopener noreferrer"&gt;https://doi.org/10.1109/MS.2024.3377897&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;Exploring Empathy in Software Engineering: Insights from a Grey Literature Analysis of Practitioners' Perspectives, TOSEM, 2025&lt;br&gt;
🔗 &lt;a href="https://arxiv.org/abs/2507.05325" rel="noopener noreferrer"&gt;https://arxiv.org/abs/2507.05325&lt;/a&gt; (preprint)&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  📖 Related Research
&lt;/h3&gt;

&lt;p&gt;If you're looking to explore Grey Literature and online communities as research data in SE, you can start with those studies: &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Mining DEV for social and technical insights about software development
🔗 &lt;a href="https://doi.org/10.1109/MSR52588.2021.00053" rel="noopener noreferrer"&gt;https://doi.org/10.1109/MSR52588.2021.00053&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;What Evidence We Would Miss If We Do Not Use Grey Literature?
🔗 &lt;a href="https://doi.org/10.1145/3475716.3475777" rel="noopener noreferrer"&gt;https://doi.org/10.1145/3475716.3475777&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;If you're interested in exploring human aspects of software work, from empathy to inclusion, communication to stress, the DEV community is an excellent place to start. It’s full of rich, first-person narratives that reflect the lived realities of developers worldwide.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;🔍 Whether you're an SE researcher or a curious practitioner, DEV offers insights that go far beyond code!&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;🙏 &lt;strong&gt;Thank you, DEV community!&lt;/strong&gt; I'd like to acknowledge the generosity of developers who openly share their experiences, reflections, and challenges here on DEV. To everyone who writes, reads, comments, and supports this community: thank you for contributing to knowledge exchange and inspiring meaningful, human-centered research in software engineering.&lt;/p&gt;

&lt;p&gt;Finally, if you'd like to know more, check out the published studies or reach out. I’m happy to share experiences and support others on this journey!&lt;/p&gt;

</description>
      <category>softwareengineering</category>
      <category>research</category>
      <category>qualitativeresearch</category>
      <category>openscience</category>
    </item>
  </channel>
</rss>
