DOI:10.1109/CIMCA.2008.63
Corpus ID: 15355153
Mining Bug Repositories--A Quality Assessment
Philipp Schügerl, J. Rilling, P. Charland
Published 2008
Computer Science

2008 International Conference on Computational Intelligence for Modelling Control & Automation
The process of evaluating, classifying, and assigning bugs to programmers is a difficult and time consuming task which greatly depends on the quality of the bug report itself. It has been shown that… Expand
View On IEEE
Researchgate.Net
Share This Paper
22 Citations
Background Citations
8
Methods Citations
6
Figures, Tables, and Topics from this paper
Table 1
Figure 1
Table 2
Table 3
Bug tracking system
Natural language processing
Information retrieval
Software bug
Software engineering
Programmer
Open-source software
Capability Maturity Model
Population
22 Citations
Investigating the usefulness of stack traces in bug triaging
M. Krikke
Computer Science
2012
TLDR
It is concluded that stack traces can be used to link software artifacts and can be a valuable input for prediction models, for example using metrics of related bugs and source files. Expand
2 Citations
Mining Frequent Patterns From Bug Repositories
Kiran Kumar
2014
Mining bug repositories is an emerging area of research. Bug repositories are major source of knowledge. Different data mining algorithms can be applied on these repositories to extract useful and… Expand
1 Citation
Mining Software Repositories for Defect Categorization
S. Kumaresh, R. Baskaran
Computer Science
2015
TLDR
This work attempts to categorize defects by proposing an algorithm called Software Defect CLustering (SDCL), which aims at mining the existing online bug repositories like Eclipse, Bugzilla and JIRA for analyzing the defect description and its categorization. Expand
4 Citations
PDF
Samiksha: mining issue tracking system for contribution and performance assessment
Ayushi Rastogi, Arpit Gupta, A. Sureka
Engineering, Computer ScienceISEC
2013
TLDR
This paper proposes a contribution and performance assessment framework (called as Samiksha) in the context of Software Maintenance, and proposes several performance metrics covering different aspects and various roles such as bug reporter and bug fixer. Expand
10 Citations
PDF
Prediction of Priority and Severity of Bug Report using Deep Learning Techniques
2020
The development of software project includes many phases. Each phase has its own importance. Software maintenance is essential phase of software development. To improve software reliability, software… Expand
Natural language processing in mining unstructured data from software repositories: a review
Som Gupta, S K Gupta
Computer Science
2019
TLDR
The paper mainly focuses on sentiment analysis, summarization, traceability, norms mining and mobile analytics, and some popular open-source NLP tools that have been used to perform NLP tasks in Mining Software Repositories field. Expand
Mining textual requirements to assist architectural software design: a state of the art review
Agustin Casamayor, D. Godoy, M. Campo
Computer ScienceArtificial Intelligence Review
2011
TLDR
Intelligent text analysis techniques utilized in software engineering tasks across the software life-cycle are detailed in order to analyze works focused on automatically bridging the gap between requirements and software architectures. Expand
20 Citations
Defining a Software Maintainability Dataset: Collecting, Aggregating and Analysing Expert Evaluations of Software Maintainability
Markus Schnappinger, Arnaud Fietzke, A. Pretschner
Computer Science
2020 IEEE International Conference on Software Maintenance and Evolution (ICSME)
2020
TLDR
The judgments of a significantly larger expert group are used to create a robust maintainability dataset that contains the consensus of the experts for more than 500 Java classes and reveals that understandability is valued the most by the experts. Expand
1 Citation
Bug characteristics in open source software
Lin Tan, Chen Liu, Zhenmin Li, Xuanhui Wang, Yuanyuan Zhou, ChengXiang Zhai
Computer Science
Empirical Software Engineering
2013
TLDR
This work studies software bug characteristics by sampling 2,060 real world bugs in three large, representative open-source projects and uses machine learning techniques to classify 109,014 bugs automatically, suggesting semantic bugs are the dominant root cause. Expand
127 Citations
PDF
OntEQAM – A Methodology for Assessing Evolvability as a Quality Factor in Software Ecosystems
Aseel Hmood, Philipp Schugerl, J. Rilling
2010
Software development and evolution are highly distributed processes that involve a multitude of supporting tools and resources. Knowledge relevant to these resources is typically dispersed over a… Expand
4 Citations
...
1
2
3
...
References
SHOWING 1-10 OF 26 REFERENCES
Fine grained indexing of software repositories to support impact analysis
G. Canfora, L. Cerulo
Computer Science
MSR '06

2006
TLDR
This paper deals with impact analysis of a change request and shows that data stored in software repositories are a good descriptor on how past change requests have been resolved, on a set of three open-source projects. Expand
86 Citations
PDF
Quality of bug reports in Eclipse
Nicolas Bettenburg, Sascha Just, Adrian Schröter, Cathrin Weiss, R. Premraj, T. Zimmermann
Computer Science
eclipse '07

2007
TLDR
This work conducts a survey among ECLIPSE developers to determine the information in reports that they widely used and the problems frequently encountered, and presents a prototype of a quality-meter tool that measures the quality of bug reports by scanning its content. Expand
Automatic bug triage using text categorization
Davor Cubranic, G. Murphy
Computer Science
SEKE

2004
TLDR
This paper proposes to apply machine learning techniques to assist in bug triage by using text categorization to predict the developer that should work on the bug based on thebug’s description. Expand
405 Citations
PDF
What Makes a Good Bug Report?
Nicolas Bettenburg, Sascha Just, Adrian Schröter, Cathrin Weiss, R. Premraj, T. Zimmermann
Engineering, Computer Science
IEEE Transactions on Software Engineering
2010
TLDR
The CUEZILLA prototype is a tool that measures the quality of new bug reports and recommends which elements should be added to improve the quality, and discusses several recommendations for better bug tracking systems which should focus on engaging bug reporters, better tool support, and improved handling of bug duplicates.Expand
562 Citations
PDF
Supporting change request assignment in open source development
G. Canfora, L. Cerulo
Computer Science
SAC

2006
TLDR
A study on how change requests have been assigned to developers involved in an open source project and a method to suggest the set of best candidate developers to resolve a new change request are proposed.Expand
127 Citations
PDF
Coping with an open bug repository
J. Anvik, L. Hiew, G. Murphy
Computer Science
eclipse '05

2005
TLDR
An initial characterization of two open bug repository from the Eclipse and Firefox projects is provided, the duplicate bug and bug triage problems that arise with these open bug repositories are described, and how machine learning technology is applied to help automate these processes are discussed. Expand
234 Citations
PDF
Detection of Duplicate Defect Reports Using Natural Language Processing
P. Runeson, Magnus Alexandersson, Oskar Nyholm
Computer Science
29th International Conference on Software Engineering (ICSE'07)
2007
TLDR
This work investigates using natural language processing (NLP) techniques to identify duplicates in defect reports at Sony Ericsson mobile communications, and shows that about 2/3 of the duplicates can possibly be found using the NLP techniques. Expand
489 Citations
A Linguistic Analysis of How People Describe Software Problems
Amy J. Ko, B. Myers, Duen Horng Chau
Computer Science
Visual Languages and Human-Centric Computing (VL/HCC'06)
2006
TLDR
A study of nearly 200,000 bug report titles exhibited sufficient regularity to parse with an accuracy of 89%, enabling a number of new automated analyses and suggesting new designs for more structured report forms.Expand
156 Citations
PDF
Natural Language Processing: Mature Enough for Requirements Documents Analysis?
L. Kof
Computer Science
NLDB

2005
TLDR
An overview of the existing terminology extraction methods is given and it is shown how they can be integrated to reach a comprehensive text analysis approach and would both detect inconsistencies in the requirements document and extract an ontology after elimination of inconsistencies.Expand
44 Citations
PDF
How Long Will It Take to Fix This Bug?
Cathrin Weiss, R. Premraj, T. Zimmermann, A. Zeller
Computer Science
Fourth International Workshop on Mining Software Repositories (MSR'07:ICSE Workshops 2007)
2007
TLDR
This work presents an approach that automatically predicts the fixing effort, i.e., the person-hours spent on fixing an issue, using the Lucene framework to search for similar, earlier reports and use their average time as a prediction. Expand
352 Citations
PDF
...
1
2
3
...
SORT BY
Related Papers
New features for duplicate bug detection
Nathan Klein, Christopher S. Corley, Nicholas A. Kraft
Computer Science
MSR

2014
TLDR
A range of metrics based on the topic distribution of the issue reports are introduced, relying only on data taken directly from bug reports, including a novel metric that measures the first shared topic between two topic-document distributions.
24 Citations
An approach to detecting duplicate bug reports using natural language and execution information
Xiaoyin Wang, L. Zhang, Tao Xie, J. Anvik, Jiasu Sun
Computer Science
ACM/IEEE 30th International Conference on…
2008
TLDR
The experimental results show that the approach can detect 67%-93% of duplicate bug reports in the Firefox bug repository, compared to 43%-72% using natural language information alone.
497 Citations
Show More
2/10
Abstract
Figures, Tables, and Topics
22 Citations
26 References
Related Papers
Stay Connected With Semantic Scholar
What Is Semantic Scholar?
Semantic Scholar is a free, AI-powered research tool for scientific literature, based at the Allen Institute for AI.
Learn More
About
About Us
Publishers
Beta Program
Contact
Research
Team
Datasets
Open Corpus
Supp.ai
Resources
Librarians
Tutorials
FAQ
API
Proudly built by AI2
Terms of ServicePrivacy Policy
By clicking accept or continuing to use the site, you agree to the terms outlined in our Privacy Policy, Terms of Service, and Dataset License
ACCEPT & CONTINUE