Software Engineering

Mining Subscenario Refactoring Opportunities in Behaviour-Driven Software Test Suites: ML Classifiers and LLM-Judge Baselines
Avatar
Ali Hassaan Mughal
71 views
AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development
Avatar
librarian
80 views
Finding Duplicates in 1.1M BDD Steps: cukereuse, a Paraphrase-Robust Static Detector for Cucumber and Gherkin
Avatar
Ali Hassaan Mughal
103 views
CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents
Avatar
librarian
127 views
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications
Avatar
Tzafrir Rehan
149 views
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Avatar
librarian
139 views
Rethinking Autonomy: Preventing Failures in AI-Driven Software
  Engineering
Avatar
Joydeep
523 views
Are Large Language Models Robust in Understanding Code Against
  Semantics-Preserving Mutations?
Avatar
librarian
537 views
Mutation Testing framework for Machine Learning
Avatar
rsingh80
734 views
Patched RTC: evaluating LLMs for diverse software development tasks
Avatar
Asankhaya Sharma
740 views
Patched MOA: optimizing inference for diverse software development tasks
Avatar
Asankhaya Sharma
820 views
Pitfalls in Language Models for Code Intelligence: A Taxonomy and Survey
Avatar
Xinyu She
886 views
Runtime Resolution of Feature Interactions through Adaptive Requirement
  Weakening
Avatar
Simon Chu
844 views
Demystifying Compiler Unstable Feature Usage and Impacts in the Rust
  Ecosystem
Avatar
Chenghao Li
833 views
Towards the decentralized coordination of multiple self-adaptive systems
Avatar
Paul-Andrei Dragan
863 views
Variance of ML-based software fault predictors: are we really improving
  fault prediction?
Avatar
Domenic Bubel
981 views
Exploring Behaviours of RESTful APIs in an Industrial Setting
Avatar
Stefan Karlsson
829 views
Evaluating Pre-trained Language Models for Repairing API Misuses
Avatar
Ting Zhang
829 views
Formal Runtime Error Detection During Development in the Automotive
  Industry
Avatar
Jesko Hecking-Harbusch
945 views
Exploring Large Language Models for Code Explanation
Avatar
Paheli Bhattacharya
802 views
Leveraging Deep Learning for Abstractive Code Summarization of
  Unofficial Documentation
Avatar
AmirHossein Naghshzan
916 views
Vision-Based Mobile App GUI Testing: A Survey
Avatar
Shengcheng Yu
821 views
Using ChatGPT throughout the Software Development Life Cycle by Novice
  Developers
Avatar
Muhammad Waseem
895 views
Less is More? An Empirical Study on Configuration Issues in Python PyPI
  Ecosystem
Avatar
Yun Peng
832 views
Unleashing the Power of Clippy in Real-World Rust Projects
Avatar
Chunmiao Li
907 views
The Effects of Computational Resources on Flaky Tests
Avatar
Denini Silva
797 views
A comprehensible analysis of the efficacy of Ensemble Models for Bug
  Prediction
Avatar
Ingrid Marc¸al
810 views
Large Language Models for Code Analysis: Do LLMs Really Do Their Job?
Avatar
Chongzhou Fang
974 views
SURE: A Visualized Failure Indexing Approach using Program Memory
  Spectrum
Avatar
Yi Song
775 views
Automated Repair of Declarative Software Specifications in the Era of
  Large Language Models
Avatar
Md Rashedul Hasan
761 views
The Software Heritage Open Science Ecosystem
Avatar
Roberto Di Cosmo
783 views
A Critical Review of Large Language Model on Software Engineering: An
  Example from ChatGPT and Automated Program Repair
Avatar
Quanjun Zhang
887 views