MSR 2015 May 16–17. Florence, Italy
The 12th Working Conference on Mining Software Repositories

Keynotes

Confessions of a Worldly Software Miner
Radu Marinescu, Politehnical University of Timisoara, Romania

Program at a glance

DAY 1
8:15-8:30 MSR Opening Message
8:30-9:30 Keynote - Radu Marinescu
9:30-10:30 1. Practice Papers (Reports from the Trenches)
10:30-11:00 Coffee Break
11:00-12:30 2. Everything Changes (or Stays the Same)
12:30-14:00 Lunch
14:00-15:30 3. Interaction Data and App Mining
15:30-16:00 Coffee Break
16:00-17:00 4. MSR Challenge;
14 challenge presentation
17:00-18:00 5. MSR Inaugural Parallel Sessions:
Data Papers (Sala Verde)
Short Papers (room 203)
20:00 Social Dinner and Awards
   
DAY 2
8:30-9:00 Announcements and MIP talk
9:00-10:30 1. Code Review (that Passed Peer Review)
10:30-11:00 Coffee Break
11:00-12:30 2. Ecosystems, APIs, and Architecture
12:30-13:30 Lunch
13:30-14:30 3. Data and Short Papers Poster Session
14:30-15:30 4. Scary stuff: Bugs, Risks, and Vulnerabilities
15:30-16:00 Coffee Break
16:00-17:00 5. Computer Musicians Bullied for Using Gists
17:00-18:00 6. Licenses, Deep Learning, and Process Mining
18:00-18:10 Wrap-Up

MSR 2015 Potential Schedule

Program print available here

Use the links below to jump to the day program schedule.

Invited Papers

DAY 1
8:15-8:30 MSR Opening Message
8:30-9:30 Keynote
  Confessions of a Worldly Software Miner
Radu Marinescu
(Politehnica University of Timisoara, Romania)
9:30-10:30 1. Practice Papers (Reports from the Trenches) [Session Chair: Christian Bird]
  Code Ownership and Software Quality: A Replication Study
Michaela Greiler, Kim Herzig and Jacek Czerwonka
(Microsoft, United States, Microsoft Research, United Kingdom and Microsoft Corp., United States)

  Extracting Facts from Performance Tuning History of Scientific Applications for Predicting Effective Optimization Patterns
Masatomo Hashimoto, Masaaki Terai, Toshiyuki Maeda and Kazuo Minami
(RIKEN Advanced Institute for Computational Science, Japan)

  Mining Component Repositories for Installability Issues
Pietro Abate, Roberto Di Cosmo, Louis Gesbert, Fabrice Le Fessant, Ralf Treinen and Stefano Zacchiroli
(INRIA, France and Université Paris Diderot, France)

Preprint Available

10:30-11:00 Coffee Break
11:00-12:30 2. Everything Changes (or Stays the Same) [Session Chair: Denys Poshyvanyk]
  The Uniqueness of Changes: Characteristics and Applications
Baishakhi Ray, Meiyappan Nagappan, Christian Bird, Nachiappan Nagappan and Thomas Zimmermann
(University of California, Davis, United States, Rochester Institute of Technology, United States and Microsoft Research, United States)

  Co-evolution of infrastructure and Source Code - An Empirical Study
Yujuan Jiang and Bram Adams
(Polytechnique Montreal, Canada)

Preprint Available

  Mining Energy-Aware Commits
Irineu Moura, Gustavo Pinto, Felipe Ebert and Fernando Castor
(Federal University of Pernambuco, Brazil)

Preprint Available

  Why Power Laws? An Explanation from Fine-Grained Code Changes
Zhongpeng Lin and Jim Whitehead
(University of California, Santa Cruz)

Preprint Available

  Sameness: An Experiment in Code Search
Lee Martie and André van der Hoek
(UCI, United States)

12:30-14:00 Lunch
14:00-15:30 3. Interaction Data and App Mining [Session Chair: Mei Nagappan]
  Using Developer-Interaction Trails to Triage Change Requests
Motahareh Bahrami Zanjani, Huzefa Kagdi and Christian Bird
(Wichita State University, United States and Microsoft Research, United States)

  Studying Developers Copy and Paste Behavior
Tarek Ahmed, Weiyi Shang and Ahmed Hassan
(Queen's University, Canada)

  Mining Android App Usages for Generating Actionable GUI-based Execution Scenarios
Mario Linares-Vásquez, Martin White, Carlos Eduardo Bernal Cardenas, Kevin Moran and Denys Poshyvanyk
(The College of William and Mary, United States)

Preprint Available

  The App Sampling Problem for App Store Mining
William Martin, Mark Harman, Yue Jia, Federica Sarro and Yuanyuan Zhang
(University College London, United Kingdom)

  Unveiling Exception Handling Bug Hazards in Android based on GitHub and Google Code Issues
Roberta Coelho, Lucas Almeida, Georgios Gousios and Arie van Deursen
(Federal University of Rio Grande do Norte, Brazil, Radboud University Nijmegen, Netherlands and Delft University of Technology, Netherlands)

15:30-16:00 Coffee Break
16:00-17:00 4. MSR Challenge; 14 challenge presentation
  The Synergy Between Voting and Acceptance of Answers on StackOverflow, or the Lack thereof
Neelamadhav Gantayat, Pankaj Dhoolia, Rohan Padhye, Senthil Mani and Vibha Singhal Sinha
(IBM Research, India)

  Quality questions need quality code: Classifying code fragments on StackOverflow
Maarten Duijn, Adam Kucera and Alberto Bacchelli
(Delft University of Technology, Netherlands and Czech Technical University in Prague, Czech Republic)

  ETA: Estimated Time of Answer, Predicting Response Time in Stack Overflow
Jeffrey Goderie, Brynjolfur Mar Georgsson, Bastiaan van Graafeiland and Alberto Bacchelli
(Delft University of Technology, Netherlands)

Preprint Available

  Going Green: An Exploratory Analysis of Energy- Related Questions
Haroon Malik, Peng Zhao and Michael Godfrey
(University of Waterloo, Canada)

  Mining StackOverflow to Filter out Off-topic IRC Discussion
Shaiful Chowdhury and Abram Hindle
(University of Alberta, Canada)

Preprint Available

  An Insight into the Unresolved Questions at Stack Overflow
Mohammad Masudur Rahman and Chanchal K. Roy
(University of Saskatchewan, Canada)

  Mining Successful Answers in Stack Overflow
Fabio Calefato, Filippo Lanubile, Maria Concetta Marasciulo and Nicole Novielli
(University of Bari, Italy)

Preprint Available

  Quick Trigger on Stack Overflow: A study of gamification-influenced member tendencies
Yong Jin, Xin Yang, Raula Gaikovina Kula, Eunjong Choi, Hajimu Iida and Katsuro Inoue
(Nara Institute of Science and Technology and Osaka University, Japan)

Preprint Available

  Intuition vs. Truth: Evaluation of Common Myths about StackOverflow Posts
Verena Honsel, Steffen Herbold and Jens Grabowski
(Universität Göttingen, Germany)

  Automatic Assessments of Code Explanations: Predicting answering times on Stack Overflow
Selman Ercan, Quinten Stokkink and Alberto Bacchelli
(Delft University of Technology, Netherlands)

Preprint Available

  Which Non-functional Requirements do Developers Focus on?An Empirical Study on Stack Overflow using Topic Analysis
Jie Zou, Ling Xu, Weikang Guo, Meng Yan, Dan Yang and Xiaohong Zhang
(Chongqing University, China)

  Stack Overflow badges and user behavior: An econometric approach
Andrew Marder
(Harvard Business School, United States)

  Employing Source Code Information to Improve Question-Answering in Stack Overflow
Themistoklis Diamantopoulos and Andreas Symeonidis
(Aristotle University of Thessaloniki, Greece)

Preprint Available

  One-day flies on StackOverflow - Why the vast majority of StackOverflow users only posts once
Rogier Slag, Mike de Waard and Alberto Bacchelli
(Delft University of Technology, Netherlands)

Preprint Available

17:00-18:00 5. MSR Inaugural Parallel Sessions:
Data Papers (Sala Verde)
  A Repository with 44 Years of Unix Evolution
Diomidis Spinellis
(Athens University of Economics and Business, Greece)

Preprint Available
Poster Available

  The Debsources Dataset: Two Decades of Debian Source Code Metadata
Stefano Zacchiroli
(Univ Paris Diderot, France)

Preprint Available

  A Dataset of the Activity of the git superrepository of Linux
Daniel German, Bram Adams and Ahmed E. Hassan
(University of Victoria, Canada, École Polytechnique de Montréal, Canada and Queen's University, Canada)

Preprint Available

  StORMeD: Stack Overflow Ready Made Data
Luca Ponzanelli, Andrea Mocci and Michele Lanza
(University of Lugano, Switzerland)

Preprint Available

  The MetricsGrimoire Database Collection
Jesus M. Gonzalez-Barahona, Gregorio Robles and Daniel Izquierdo-Cortazar
(Universidad Rey Juan Carlos, Spain, Universidad Rey Juan Carlos, Spain and Bitergia, Spain)

Preprint Available

  Landfill: an Open Dataset of Code Smells with Public Evaluation
Fabio Palomba, Dario Di Nucci, Michele Tufano, Gabriele Bavota, Rocco Oliveto, Denys Poshyvanyk and Andrea De Lucia
(University of Salerno, Italy, The College of William and Mary, United States, Free University of Bolzano-Bozen, Italy and University of Molise, Italy)

Poster Available

  Fuse: A Reproducible, Extendable, Internet-scale Corpus of Spreadsheets
Titus Barik, Kevin Lubick, Justin Smith, John Slankas and Emerson Murphy-Hill
(North Carolina State University, United States)

Preprint Available
Poster Available

  Dataset of developer-labeled commit messages for task classification validation
Andreas Mauczka, Florian Brosch, Christian Schanes and Thomas Grechenig
(Vienna University of Technology, Austria)

Poster Available

  A Novel Industry Grade Dataset for Fault Prediction based on Model-Driven Developed Automotive Embedded Software
Harald Altinger, Sebastian Siegl, Yanja Dajsuren and Franz Wotawa
(Audi Electronics Venture GmbH, Germany, Audi Electronics Venture GmbH, Germany, Eindhoven University of Technology, Netherlands and Technische Universitaet Graz, Austria)

Preprint Available
Poster Available

  The Firefox Defect Temporal Dataset
Mayy Habayeb, Andriy Miranskyy, Syed Shariyar Murtaza, Leotis Buchanan and Ayse Bener
(Ryerson University, Canada)

Poster Available

  An Architectural Evolution Dataset
Michel Wermelinger and Yijun Yu
(The Open University, United Kingdom)

Preprint Available

  A Dataset For API Usage
Anand Sawant and Alberto Bacchelli
(Delft University of Technology, Netherlands)

Preprint Available
Poster Available

  Generating the Blueprints of the Java Ecosystem
Vassilios Karakoidas, Dimitris Mitropoulos, Georgios Gousios, Diomidis Spinellis and Panagiotis Louridas
(Athens University of Economics and Business, Greece, Columbia University, United States and Radboud University Nijmegen, Netherlands)

Poster Available

  A Data Set for Social Diversity Studies of GitHub Teams
Bogdan Vasilescu, Alexander Serebrenik and Vladimir Filkov
(University of California, Davis, United States and Eindhoven University of Technology, Netherlands)

Preprint Available

  A Dataset of High Impact Bugs: Manually-Classified Issue Reports
Masao Ohira, Yutaro Kashiwa, Yosuke Yamatani, Hayato Yoshiyuki, Yoshiya Maeda, Nachai Limsettho, Keisuke Fujino, Hideaki Hata, Akinori Ihara and Kenichi Matsumoto
(Wakayama University, Japan and Nara Institute of Science and Technology, Japan)

Poster Available

  A Dataset of Open Source Android Applications
Daniel Krutz, Mehdi Mirakhorli, Sam Malachowsky, Andres Ruiz, Jacob Peterson and Andrew Filipski
(Rochester Institute of Technology, United States)

Poster Available

Short Papers (room 203) [Session Chair: Hongyu Zhang]
  Automatically Prioritizing Pull Requests
Erik van der Veen, Georgios Gousios and Andy Zaidman
(Delft University of Technology, Netherlands and Radboud University Nijmegen, Netherlands)

Preprint Available

  Matching GitHub developer profiles to job advertisments
Claudia Hauff and Georgios Gousios
(Delft University of Technology, Netherlands and Radboud University Nijmegen, Netherlands)

  Wait For It: Determinants of Pull Request Evaluation Latency on GitHub
Yue Yu, Huaimin Wang, Vladimir Filkov, Premkumar Devanbu and Bogdan Vasilescu
(National University of Defense Technology, China and University of California, United States)

Preprint Available

  Toward Reusing Code Changes
Yoshiki Higo, Akio Ohtani, Shinpei Hayashi, Hideaki Hata and Shinji Kusumoto
(Osaka University, Japan and Tokyo Institute of Technology, Japan)

Preprint Available

  Modifications, Tweaks, and Bug Fixes in Architectural Tactics
Mehdi Mirakhorli and Jane Cleland-Huang
(Rochester Institute of Technology, United States and DePaul, United States)

  Do Onboarding Programs Work?
Adriaan Labuschagne and Reid Holmes
(University of Waterloo, Canada)

  An enhanced Graph-based infrastructure for Software Search Engines
Colin Atkinson and Marcus Schumacher
(Universitiy of Mannheim, Germany)

  Organizational volatility and post-release defects: A replication case study using data from Google Chrome
Samuel Mugnaini Donadelli, Yue Cai Zhu and Peter Rigby
(Concordia University, Canada)

  Detecting and Mitigating Secret-Key Leaks in Source Code Repositories
Vibha Singhal Sinha, Diptikalyan Saha, Pankaj Dhoolia, Rohan Padhye and Senthil Mani
(IBM Research, India)

  Summarizing Complex Development Artifacts by Mining Heterogenous Data
Luca Ponzanelli, Andrea Mocci and Michele Lanza
(University of Lugano, Switzerland)

Preprint Available

20:00 Social Dinner and Awards
   
DAY 2
8:30-9:00 Announcements and MIP talk
9:00-10:30 1. Code Review (that Passed Peer Review) [Session Chair: Peter Rigby]
  Characteristics of Useful Code Reviews: An Empirical Study at Microsoft
Amiangshu Bosu, Michaela Greiler and Christian Bird
(University of Alabama, United States, Microsoft Research, United States)

Preprint Available

  Will they like this? Evaluating Code Contributions With Language Models
Vincent Hellendoorn, Premkumar Devanbu and Alberto Bacchelli
(Delft University of Technology, Netherlands and University of California, Davis)

Preprint Available

  Investigating Code Review Practices in Defective Files: An Empirical Study of the Qt System
Patanamon Thongtanunam, Shane McIntosh, Ahmed E. Hassan and Hajimu Iida
(Nara Institute of Science and Technology, Japan and Queen's University, Canada)

Preprint Available

  Partitioning Composite Code Changes to Facilitate Code Review
Yida Tao and Sunghun Kim
(The Hong Kong University of Science and Technology, Hong Kong)

  Lessons Learned from Building and Deploying a Code Review Analytics Platform
Christian Bird, Trevor Carnahan and Micheala Greiler
(Microsoft Research, United States, Microsoft, United States and Microsoft, Germany)

10:30-11:00 Coffee Break
11:00-12:30 2. Ecosystems, APIs, and Architecture [Session Chair: Andrew Begel]
  Ecosystems in GitHub and a Method for Ecosystem Identification using Reference Coupling
Kelly Blincoe, Francis Harrison and Daniela Damian
(University of Victoria, New Zealand, SEGAL Labs, Canada and University of Victoria, Canada)

Preprint Available

  A historical analysis of Debian package incompatibilities
Tom Mens, Maïlick Claes, Roberto Di Cosmo and Jerome Vouillon
(University of Mons, Belgium, Université Paris Diderot, France and INRIA, France)

  Recommending Posts Concerning API Issues in Developer Q&A Sites
Wei Wang, Haroon Malik and Mike Godfrey
(University of Waterloo, Canada)

  An Empirical Study of Architectural Change in Open-Source Software Systems
Duc Le, Pooyan Behnamghader, Joshua Garcia, Daniel Link, Arman Shahbazian and Nenad Medvidovic
(University of Southern California, United States and George Mason University, United States)

  A Study on the Role of Software Architecture in the Evolution and Quality of Software
Ehsan Kouroshfar, Mehdi Mirakhorli, Hamid Bagheri, Lu Xiao, Sam Malek and Yuanfang Cai
(George Mason University, United States, Rochester Institute of Technology, United States and Drexel University, United States)

12:30-13:30 Lunch
13:30-14:30 3. Data and Short Papers Poster Session
14:30-15:30 4. Scary stuff: Bugs, Risks, and Vulnerabilities [Session Chair: Bram Adams]
  Are These Bugs Really 'Normal'?
Ripon Saha, Julia Lawall, Sarfraz Khurshid amd Dewayne E. Perry
(The University of Texas at Austin, United States and Sorbonne University, France)

  Do Bugs Foreshadow Vulnerabilities? A Study of the Chromium Project
Felivel Camilo, Andrew Meneely and Meiyappan Nagappan
(Rochester Institute of Technology, United States)

  Characterization and prediction of issue-related risks in software projects
Morakot Choetkiertikul, Hoa Khanh Dam, Truyen Tran and Aditya Ghose
(University of Wollongong, Australia and Deakin University, Australia)

Preprint Available

15:30-16:00 Coffee Break
16:00-17:00 5. Computer Musicians Bullied for Using Gists [Session Chair: Alberto Bacchelli]
  An Empirical Study of End-user Programmers in the Computer Music Community
Gregory Burlet and Abram Hindle
(University of Alberta, Canada)

Preprint Available

  Are Bullies more Productive? Empirical Study of Affectiveness vs. Issue Fixing Time
Marco Ortu, Bram Adams, Giuseppe Destefanis, Parastou Tourani, Michele Marchesi and Roberto Tonelli
(University of Cagliari, Italy, École polytechnique de Montréal, Canada and CRIM, The Islamic Republic of Iran)

Preprint Available

  What is the Gist? Understanding the Use of Public Gists on GitHub
Weiliang Wang, Germán Poo-Caamaño, Evan Wilde and Daniel German
(University of Victoria, Canada)

17:00-18:00 6. Licenses, Deep Learning, and Process Mining [Session Chair: Georgios Gousios]
  A Method to Detect License Inconsistencies in Large-Scale Open Source Projects
Yuhao Wu, Yuki Manabe, Tetsuya Kanda, Daniel German and Katsuro Inoue
(Osaka University, Japan, Kumamoto University, Japan and University of Victoria, Canada)

  Toward Deep Learning Software Repositories
Martin White, Christopher Vendome, Mario Linares-Vásquez and Denys Poshyvanyk
(College of William and Mary, United States)

Preprint Available

  Identifying Software Process Management Challenges: Survey of Practitioners in a Large Global IT Company
Monika Gupta, Ashish Sureka, Padmanabhuni Srinivas and Allahbaksh Asadullah
(Indraprastha Institute of Information Technology, India and Infosys Technologies Ltd., India)

Preprint Available

18:00-18:10 Wrap-Up [Session Chair: Romain Robbes]


If you wish to have your accepted paper's pre-print made available at the MSR website or want your personal url to be linked to your paper, please kindly send an email to the Web Chair .