RAG4Tickets: AI-Powered Ticket Resolution via Retrieval-Augmented Generation on JIRA and GitHub Data

AI-generated keywords: Software development Fragmented knowledge Retrieval-Augmented Generation (RAG) Unified pipeline DevOps workflows

AI-generated Key Points

  • Challenges faced by software teams in resolving recurring or related issues due to fragmented knowledge across platforms like JIRA tickets, developer discussions, and GitHub pull requests
  • Introduction of the Retrieval-Augmented Generation (RAG) framework integrating Sentence-Transformers with FAISS-based vector search for context-aware ticket resolution recommendations
  • Utilization of historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases and generate explainable resolution suggestions using a Large Language Model (LLM)
  • Aim of the framework to improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments through a unified pipeline linking JIRA and GitHub data
  • Challenges in implementing the RAG framework such as unreliable suggestions from LLM, historical data bias, dataset drift, lack of explainability, scalability issues, and cost overhead
  • Proposed solutions for overcoming challenges including retrieval grounding through evidence-weighted decoding, temporal relevance weighting for dataset drift, explainability enhancements, periodic re-indexing and embedding refresh
  • Future work suggestions like adaptive learning pipelines with online embedding updates for scalability, model distillation for new domains, incremental fine-tuning driven by recent commits for evolving enterprise data
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mohammad Baqar

13 Pages
License: CC BY 4.0

Abstract: Modern software teams frequently encounter delays in resolving recurring or related issues due to fragmented knowledge scattered across JIRA tickets, developer discussions, and GitHub pull requests (PRs). To address this challenge, we propose a Retrieval-Augmented Generation (RAG) framework that integrates Sentence-Transformers for semantic embeddings with FAISS-based vector search to deliver context-aware ticket resolution recommendations. The approach embeds historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases, which are then synthesized by a Large Language Model (LLM) into grounded and explainable resolution suggestions. The framework contributes a unified pipeline linking JIRA and GitHub data, an embedding and FAISS indexing strategy for heterogeneous software artifacts, and a resolution generation module guided by retrieved evidence. Experimental evaluation using precision, recall, resolution time reduction, and developer acceptance metrics shows that the proposed system significantly improves resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments.

Submitted to arXiv on 09 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.08667v1

In modern software development environments, software teams often face challenges in resolving recurring or related issues due to fragmented knowledge scattered across various platforms such as JIRA tickets, developer discussions, and GitHub pull requests. To address this issue, a Retrieval-Augmented Generation (RAG) framework has been proposed. This framework integrates Sentence-Transformers for semantic embeddings with FAISS-based vector search to provide context-aware ticket resolution recommendations. The approach involves embedding historical JIRA tickets, user comments, and linked PR metadata to retrieve semantically similar past cases. These retrieved cases are then synthesized by a Large Language Model (LLM) into grounded and explainable resolution suggestions. The framework aims to improve resolution accuracy, fix quality, and knowledge reuse in modern DevOps environments through a unified pipeline linking JIRA and GitHub data. : However, there are challenges that need to be addressed for the successful implementation of the RAG framework. These include issues such as unreliable or misleading suggestions generated by the LLM not grounded in retrieved evidence, historical data bias from outdated or low-quality fixes in legacy tickets and PRs, dataset drift due to changes in frameworks reducing relevance of older examples and embeddings, lack of explainability leading to developer hesitation in adopting AI recommendations without transparent supporting evidence, scalability and cost overhead from indexing large-scale repositories and frequent LLM inference increasing compute and operational costs. : To overcome these challenges,such as retrieval grounding through evidence-weighted decoding,, , , < kd>temporal relevance weighting</kd> for dataset drift; for explainability; periodic re-indexing and embedding refresh for knowledge obsolescence have been suggested. : The future work includes adaptive learning pipelines with online embedding updates for scalability; model distillation for new domains; incremental fine-tuning driven by recent commits for evolving enterprise data. Overall, the RAG framework demonstrates how retrieval and generation can be effectively fused to enhance AI-assisted ticket triage and resolution generation processes in DevOps workflows.
Created on 25 Jun. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.