Our publications
Publications made by REALISE Lab
2025
Exploring the Potential of Llama Models in Automated Code Refinement: A Replication Study
Authors:
Genevieve Caumartin, Qiaolin Qin, Sharon Chatragadda, Janmitsinh Panjrolia, Heng Li, Diego Elias Costa
Venue:
The IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), 2025
A Machine Learning-Based Approach For Detecting Malicious PyPI Package
Authors:
Haya Samaana, Diego Elias Costa, Ahmad Abdellatif, Emad Shihab
Venue:
The 40th ACM/SIGAPP Symposium On Applied Computing (ACM SAC), 2025
Early Detection of Performance Regressions by Bridging Local Performance Data and Architectural Models
Authors:
Lizhi Liao, Simon Eismann, Heng Li, Cor-Paul Bezemer, Diego Elias Costa, André van Hoorn
Venue:
ICSE, 2025
Opportunities and Security Risks of Technical Leverage: A Replication Study on the NPM Ecosystems
Authors:
Haya Samaana, Diego Elias Costa, Ahmad Abdellatif, Emad Shihab
Venue:
EMSE journal, 2025
2024
Predicting the First Response Latency of Maintainers and Contributors in Pull Requests
Authors:
SayedHassan Khatoonabadi, Ahmad Abdellatif, Diego Elias Costa, Emad Shihab
Venue:
TSE, 2024
A Transformer-based Approach for Augmenting Software Engineering Chatbots Datasets
Authors:
Ahmad Abdellatif, Khaled Badran, Diego Elias Costa, Emad Shihab
Venue:
ESEM, 2024
The role of library versions in Developer-ChatGPT conversations
Authors:
Rachna Raj, Diego Elias Costa
Venue:
MSR Mining Challenge, 2024
Predicting the Impact of Crashes Across Release Channels
Authors:
Suhaib Mujahid, Diego Elias Costa, Marco Castelluccio
Venue:
MSR Industrial Track, 2024
2023
SE4AI: A Training Program Considering Technical, Social, and Professional Aspects of AI-based Software Systems
Authors:
Ahmad Abdellatif, Gita Ghiasi, Diego Elias Costa, Tanja Tajmel, Emad Shihab
Venue:
IEEE Software journal, Software Engineering Educating and Training, 2023
Understanding the Helpfulness of Stale Bot for Pull-based Development
Authors:
SayedHassan Khatoonabadi, Diego Elias Costa, Suhaib Mujahid, Emad Shihab
Venue:
TOSEM journal, 2023
Empirical analysis of security-related code reviews in npm packages
Authors:
Mahmoud Alfadel, Nicholas Alexandre Nagy, Diego Costa, Rabe Abdalkareem, Emad Shihab
Venue:
JSS journal, 2023
Dependency Update Strategies and Package Characteristics
Authors:
Abbas Jafari, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab
Venue:
TOSEM journal, 2023
Empirical analysis of security vulnerabilities in Python packages
Authors:
Mahmoud Alfadel, Diego Elias Costa, Emad Shihab
Venue:
EMSE journal, 2023
Where to Go Now? Finding Alternatives for Declining Packages in the npm Ecosystem
Authors:
Suhaib Mujahid, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab
Venue:
ASE'23: IEEE/ACM International Conference on Automated Software Engineering
An Empirical Study on Bugs Inside PyTorch: A Replication Study
Authors:
Sharon Ho, Vahid Majdinasab, Mohayeminul Islam, Diego Elias Costa, Emad Shihab, Foutse Khomh, Sarah Nadi, Muhammad Raza
Venue:
ICSME'23: International Conference on Software Maintenance and Evolution
Diversity Awareness in Software Engineering Participant Research
Authors:
Riya Dutta, Diego Elias Costa, Tanja Tajmel, Emad Shihab
Venue:
ICSE SEIS track, 2023
2022
Can Ensembling Pre-processing Algorithms Lead to Better Machine Learning Fairness?
Authors:
Khaled Badran, Pierre-Olivier Cote, Amanda Kolopanis, Rached Bouchoucha, Antonio Collante, Diego Elias Costa, Emad Shihab, Foutse Khomh
Venue:
IEEE Computer, 2022
On the Discoverability of npm Vulnerabilities in Node.js Projects
Authors:
Mahmoud Alfadel, Diego Elias Costa, Emad Shihab, Bram Adams.
Venue:
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
On Wasted Contributions: Understanding the Dynamics of Contributor-Abandoned Pull Requests
Authors:
SayedHassan Khatoonabadi, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab
Venue:
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
A case study on the stability of performance tests for serverless applications
Authors:
Simon Eismann, Diego Elias Costa, Lizhi Liao, Cor-Paul Bezemer, Weiyi Shang, Andre van Hoorn, Samuel Kounev
Venue:
Journal of Software and Systems (JSS)
Toward the Personalization of Biceps Fatigue Detection Model For Gym Activity: An Approach to Utilize Wearables’ Data From The Crowd
Authors:
Mohamed Elshafei, Diego Elias Costa, and Emad Shihab
Venue:
Sensors Journal: Applications of Body Worn Sensors and Wearables
Achievement Unlocked: A Case Study on Gamifying DevOps Practices in Industry
Authors:
Patrick Ayoup, Diego Elias Costa, Emad Shihab
Venue:
FSE'22: Symposium on the Foundations of Software Engineering, Industry Track, 2022
Not All Dependencies are Equal: An Empirical Study on Production Dependencies in NPM
Authors:
Jasmine Latendresse, Suhaib Mujahid, Diego Elias Costa, Emad Shihab
Venue:
ASE'22: International Conference on Automated Software Engineering, 2022
Open Source Software: An Approach to Controlling Usage and Risk in Application Ecosystems
Authors:
Stan Zajdel, Diego Elias Costa, Hafedh Mili
Venue:
SPLC'22: International Systems and Software Product Line Conference, 2022
2021
Towards Using Package Centrality Trend to Identify Packages in Decline
Authors:
Suhaib Mujahid, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab, Bram Adams, Aymen Saied
Venue:
IEEE Transactions on Engineering Management
Human Activity Recognition: A Comparative Study to Assess the Contribution Level of Accelerometer, ECG, and PPG Signals
Authors:
Mahsa Sadat Afzali Arani, Diego Elias Costa , Emad Shihab
Venue:
Sensors Journal: Sensors Technology and Machine Learning for Human Activity Recognition
How do developers use the Java Stream API?
Authors:
D. Costa, R. Raj
Venue:
TOSEM journal, 2023
On the Impact of Biceps Muscle Fatigue in Human Activity Recognition
Authors:
Mohamed Elshafei, Diego Costa, Emad Shihab
Venue:
Sensors Journal: Physical Human Activity Recognition Using Wearable Sensors
Dependency Smells in JavaScript Projects
Authors:
Abbas Jafari, Diego Elias Costa, Rabe Abdalkareem, Emad Shihab, and Nikolaos Tsantalis
Venue:
IEEE Transactions on Software Engineering (TSE)
A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering
Authors:
Ahmad Abdellatif, Khaled Badran, Diego Costa, and Emad Shihab
Venue:
IEEE Transactions on Software Engineering (TSE)
On the Use of Dependabot Security Pull Requests
Authors:
Mahmoud Alfadel, Diego Elias Costa, Emad Shihab
Venue:
In MSR'21: in International Conference on Mining Software Repositories
How Effective is Continuous Integration in Indicating Single-Statement Bugs?
Authors:
Jasmine Latendresse, Rabe Abdalkareem, Diego Elias Costa, Emad Shihab
Venue:
MSR'21: in International Conference on Mining Software Repositories - Mining Challenge track
Empirical Analysis of Security Vulnerabilities in Python Packages
Authors:
Mahmoud Alfadel, Diego Elias Costa, Emad Shihab
Venue:
ICSME: IEEE International Conference on Software Analysis, Evolution and Reengineering
Breaking Type-Safety in Go: An Empirical Study on the Usage of the unsafe Package
Authors:
Diego Costa, Suhaib Mujahid, Rabe Abdalkareem, and Emad Shihab
Venue:
IEEE Transactions on Software Engineering (TSE)
2020
Memory and resource leak defects and their repairs in Java projects
Authors:
Mohammad Ghanavati, Diego Costa, Janos Seboek, David Lo, and Artur Andrzejak
Venue:
Empirical Software Engineering journal (EMSE), vol. 25, 678–718
2019
What’s Wrong With My Benchmark Results? Studying Bad Practices in JMH Benchmarks
Authors:
Diego Costa, C. Bezemer, P. Leitner and A. Andrzejak
Venue:
IEEE Transactions on Software Engineering (TSE)
Enhancing Commit Graphs with Visual Runtime Clues
Authors:
Juan P. S. Alcocer, Harold C. Jaimes, Diego Costa, Alexandre Bergel and Fabian Beck
Venue:
VISSOFT'19: Working Conference on Software Visualization
Agile construction of data science DSLs (tool demo)
Authors:
Artur Andrzejak, Kevin Kiefer, **Diego Costa**, and Oliver Wenz
Venue:
TOSEGPCE'19: ACM SIGPLAN International Conference on Generative Programming: Concepts and Experiences
Benchmark-driven Software Performance Optimization
Authors:
Diego Costa
Venue:
Ph.D. thesis. 2019
2018
CollectionSwitch: a framework for efficient and dynamic collection selection
Authors:
Diego Costa and Artur Andrzejak
Venue:
CGO'18: International Symposium on Code Generation and Optimization
Memory and resource leak defects in Java Projects: an empirical study
Authors:
Mohammadreza Ghanavati, Diego Costa, Artur Andrzejak, and Janos Seboek
Venue:
ICSE-C'18: International Conference on Software Engineering: Companion Proceeedings
Heterogeneous subgraph features for information networks
Authors:
Andreas Spitz, Diego Costa, Kai Chen, Jan Greulich, Johanna Geiß, Stefan Wiesberg, and Michael Gertz
Venue:
TOSEM journal, 2023GRADES-NDA '18: ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems and Network Data Analytics
2017
2016
ORPLocator: Identifying Read Points of Configuration Options via Static Analysis
Authors:
Zhen Dong, Artur Andrzejak, David Lo and Diego Costa
Venue:
ISSRE'16: International Symposium on Software Reliability Engineering
2015
Characterization of Dynamic Memory Allocations in Real-World Applications: An Experimental Study
Authors:
Diego Costa and Rivalino Matias Jr.
Venue:
MASCOTS'15: IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
A Systematic Differential Analysis for Fast and Robust Detection of Software Aging
Authors:
Rivalino Matias, Artur Andrzejak, Fumio Machida, Diego Costa and Kishor Trivedi
Venue:
SRDS'14: IEEE International Symposium on Reliable Distributed Systems
Experimental and theoretical analyses of memory allocation algorithms
Authors:
Diego Costa, Rivalino Matias, Marcia Fernandes, and Lucio Borges
Venue:
SAC'14: ACM Symposium on Applied Computing