A Learning-Based Approach to Static Program Slicing (SPLASH 2024 - OOPSLA 2024)

Sun 20 - Fri 25 October 2024 Pasadena, California, United States

Who

Aashish Yadavally, Yi Li, Shaohua Wang, Tien N. Nguyen

Track

SPLASH 2024 OOPSLA

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 25 Oct 2024 11:00 - 11:20 at IBR West - Static Analysis and Program Verification 4 Chair(s): Anders Møller

Abstract

Traditional program slicing techniques are crucial for early bug detection and manual/automated debugging for online code snippets. Nevertheless, their inability to handle incomplete code hinders their real-world applicability in such scenarios. To overcome these challenges, we present NS-Slicer, a novel learning-based approach that predicts static program slices for both complete and partial code. Our tool leverages a pre-trained language model to exploit its understanding of fine-grained variable-statement dependencies within source code. With this knowledge, given a variable at a specific location and a statement in a code snippet, NS-Slicer determines whether the statement belongs to the backward slice or forward slice, respectively.

We conducted a series of experiments to evaluate NS-Slicer’s performance. On complete code, it predicts the backward and forward slices with an F1-score of 97.41% and 95.82%, respectively, while achieving an overall F1-score of 96.77%. Notably, in 85.20% of the cases, the static program slices predicted by NS-Slicer exactly match entire slices from the oracle. For partial programs, it achieved an F1-score of 96.77%–97.49% for backward slicing, 92.14%–95.40% for forward slicing, and an overall F1-score of 94.66%–96.62%. Furthermore, we demonstrate NS-Slicer’s utility in vulnerability detection (VD), integrating its predicted slices into an automated VD tool. In this setup, the tool detected vulnerabilities in Java code with a high F1-score of 73.38%. We also include analyses studying NS-Slicer’s promising performance and limitations, providing insights into its understanding of intrinsic code properties such as variable aliasing, leading to better slicing.

Link to Preprint

https://aashishyadavally.github.io/assets/pdf/pub-oopsla2024.pdf

DOI

https://doi.org/10.1145/3649814

Aashish Yadavally

University of Texas at Dallas

United States

Yi Li

University of Texas at Dallas

Shaohua Wang

Central University of Finance and Economics

China

Tien N. Nguyen

University of Texas at Dallas

United States

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 25 Oct
Displayed time zone: Pacific Time (US & Canada) change

11:00 - 12:20	Static Analysis and Program Verification 4OOPSLA 2024 at IBR West Chair(s): Anders Møller Aarhus University

11:00 20m Talk		A Learning-Based Approach to Static Program Slicing OOPSLA 2024 Aashish Yadavally University of Texas at Dallas, Yi Li University of Texas at Dallas, Shaohua Wang Central University of Finance and Economics, Tien N. Nguyen University of Texas at Dallas DOI Pre-print
11:20 20m Talk		Boosting the Performance of Alias-Aware IFDS Analysis with CFL-based Environment Transformers OOPSLA 2024 Haofeng Li Institute of Computing Technology at Chinese Academy of Sciences, Chenghang Shi SKLP, Institute of Computing Technology, CAS, Jie Lu SKLP, Institute of Computing Technology, CAS, Lian Li Institute of Computing Technology at Chinese Academy of Sciences; University of Chinese Academy of Sciences, Jingling Xue UNSW Sydney DOI
11:40 20m Talk		The ART of Sharing Points-to Analysis: Reusing Points-to Analysis Results Safely and Efficiently OOPSLA 2024 Shashin Halalingaiah UT Austin, IIT Madras, Vijay Sundaresan IBM Canada, Daryl Maier IBM Canada, V Krishna Nandivada IIT Madras DOI
12:00 20m Talk		UniSparse: An Intermediate Language for General Sparse Format Customization OOPSLA 2024 Jie Liu Cornell University, Zhongyuan Zhao Qualcomm, Zijian Ding UCLA, Benjamin Brock Parallel Computing Lab (PCL), Intel, Hongbo Rong Intel Labs, Zhiru Zhang Cornell University, USA DOI