SwePub
Sök i LIBRIS databas

  Extended search

L773:0098 5589 OR L773:1939 3520
 

Search: L773:0098 5589 OR L773:1939 3520 > Self-Supervised Lea...

Self-Supervised Learning to Prove Equivalence Between Straight-Line Programs via Rewrite Rules

Kommrusch, Steve (author)
Colorado State Univ, Ft Collins, CO 80523 USA.
Monperrus, Martin (author)
KTH,Teoretisk datalogi, TCS
Pouchet, Louis-Noel (author)
Colorado State Univ, Ft Collins, CO 80523 USA.
Colorado State Univ, Ft Collins, CO 80523 USA Teoretisk datalogi, TCS (creator_code:org_t)
Institute of Electrical and Electronics Engineers (IEEE), 2023
2023
English.
In: IEEE Transactions on Software Engineering. - : Institute of Electrical and Electronics Engineers (IEEE). - 0098-5589 .- 1939-3520. ; 49:7, s. 3771-3792
  • Journal article (peer-reviewed)
Abstract Subject headings
Close  
  • We target the problem of automatically synthesizing proofs of semantic equivalence between two programs made of sequences of statements. We represent programs using abstract syntax trees (AST), where a given set of semantics-preserving rewrite rules can be applied on a specific AST pattern to generate a transformed and semantically equivalent program. In our system, two programs are equivalent if there exists a sequence of application of these rewrite rules that leads to rewriting one program into the other. We propose a neural network architecture based on a transformer model to generate proofs of equivalence between program pairs. The system outputs a sequence of rewrites, and the validity of the sequence is simply checked by verifying it can be applied. If no valid sequence is produced by the neural network, the system reports the programs as non-equivalent, ensuring by design no programs may be incorrectly reported as equivalent. Our system is fully implemented for one single grammar which can represent straight-line programs with function calls and multiple types. To efficiently train the system to generate such sequences, we develop an original incremental training technique, named self-supervised sample selection. We extensively study the effectiveness of this novel training approach on proofs of increasing complexity and length. Our system,S4Eq, achieves 97% proof success on a curated dataset of 10,000 pairs of equivalent programs.

Subject headings

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Keyword

Machine learning
program equivalence
self-supervised learning
symbolic reasoning

Publication and Content Type

ref (subject category)
art (subject category)

Find in a library

To the university's database

Find more in SwePub

By the author/editor
Kommrusch, Steve
Monperrus, Marti ...
Pouchet, Louis-N ...
About the subject
NATURAL SCIENCES
NATURAL SCIENCES
and Computer and Inf ...
and Computer Science ...
Articles in the publication
IEEE Transaction ...
By the university
Royal Institute of Technology

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view