VarLifter: Recovering Variables and Types from Bytecode of Solidity Smart Contracts (SPLASH 2024 - OOPSLA 2024)

Sun 20 - Fri 25 October 2024 Pasadena, California, United States

Who

Yichuan Li, Wei Song, Jeff Huang

Track

SPLASH 2024 OOPSLA

Time Zone

The program is currently displayed in (GMT-07:00) Pacific Time (US & Canada).

Use conference time zone: (GMT-07:00) Pacific Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 23 Oct 2024 14:40 - 15:00 at San Gabriel - Formal Methods 1 Chair(s): Benjamin Delaware

Abstract

Since funds or tokens in smart contracts are maintained through specific state variables, contract audit, an effective means for security assurance, particularly focuses on these variables and their related operations. However, the absence of publicly accessible source code for numerous contracts, with only bytecode exposed, hinders audit efforts. Recovering variables and their types from Solidity bytecode is thus a critical task in smart contract analysis and audit, yet this is a challenging task because the bytecode loses variable and type information, only with low-level data operated by stack manipulations and untyped memory/storage accesses. The state-of-the-art smart contract decompilers miss identifying many variables and incorrectly infer the types for many identified variables. To this end, we propose \textsf{\textsc{VarLifter}}, a lifter dedicated to the precise and efficient recovery of typed variables. \textsf{\textsc{VarLifter}} interprets every read or written field of a data region as at least one potential variable, and after discarding falsely identified variables, it progressively refines the variable types based on the variable behaviors in the form of operation sequences. We evaluate \textsf{\textsc{VarLifter}} on 34,832 real-world Solidity smart contracts. \textsf{\textsc{VarLifter}} attains a precision of 97.48% and a recall of 91.84% for typed variable recovery. Moreover, \textsf{\textsc{VarLifter}} finishes analyzing 77% of smart contracts in around 10 seconds per contract. If \textsf{\textsc{VarLifter}} is used to replace the variable recovery modules of the two state-of-the-art Solidity bytecode decompilers, 52.4%, and 74.6% more typed variables will be correctly recovered, respectively. The applications of \textsf{\textsc{VarLifter}} to contract decompilation, contract audit, and contract bytecode fuzzing illustrate that the recovered variable information improves many contract analysis tasks.

DOI

https://doi.org/10.1145/3689711

Yichuan Li

Nanjing University of Science and Technology

China

Wei Song

Nanjing University of Science and Technology

China

Jeff Huang

Texas A&M University

United States