Skip to content

Latest commit

 

History

History
142 lines (133 loc) · 38.5 KB

File metadata and controls

142 lines (133 loc) · 38.5 KB

Overview

Glossary

Entity Description
zksolc The Solidity compiler, developed by Matter Labs.
solc The original Solidity compiler, developed by the Ethereum community. Called by zksolc as a subprocess to get the IRs of the source code of the project.
LLVM The compiler framework, used for optimizations and assembly generation.
EraVM assembler/linker The tool written in Rust. Translates the assembly emitted by LLVM to the target bytecode.
Virtual machine The ZKsync Era virtual machine called EraVM with a custom instruction set.
Intermediate representation (IR) The data structure or code used internally by the compiler to represent source code.
Yul One of the Solidity IRs. Is a superset of the assembly available in Solidity. Used by default for contracts written in Solidity ≥0.8.
EVMLA One of the Solidity IRs called EVM legacy assembly. Is a predecessor of Yul, but must closer to the pure EVM bytecode. Used by default for contracts written in Solidity <0.8.
LLVM IR The IR native to the LLVM framework.
EraVM assembly The text representation of the EraVM bytecode. Emitted by the LLVM framework. Translated into the EraVM bytecode by the EraVM assembler/linker.
EraVM bytecode The smart contract bytecode, executed by EraVM.
Stack The segment of the non-persistent contract memory. Consists of two parts: 1. The global data, accessible from anywhere. Available to the compiler, not available to the user code. 2. The function-local stack frame without the depth limit like in EVM.
Heap The segment of the non-persistent contract memory. All the data is globally accessible by both the compiler and user code. The allocation is handled by the solc’s Yul/EVMLA allocator only.
Auxiliary heap The segment of the non-persistent contract memory, introduced to avoid conflicts with the solc’s allocator. All the data is globally accessible by the compiler only. The allocation is handled by the zksolc’s compiler only. All contract calls specific to ZKsync, including the system contracts, are made via the auxiliary heap. It is also used to return data (e.g. the array of immutables) from the constructor.
Calldata The segment of the non-persistent contract memory. The heap or auxiliary heap of the parent/caller contract.
Return data The segment of the non-persistent contract memory. The heap or auxiliary heap of the child/callee contract.
Contract storage The persistent contract memory. No relevant differences from that of EVM.
System contracts The special set of ZKsync kernel contracts written in Solidity by Matter Labs.
Contract context The special storage of VM that keeps data like the current address, the caller’s address, etc.

Concepts

Instruction Set

The table below describes the scheme of translation Yul and EVMLA to EraVM bytecode.

At the moment it does not explain much of the LLVM IR and assembly aspects, but mainly focus on the EVM-equivalence for the sake of assisting the upcoming audit.

Yul name EVMLA name Descriptive keywords Input/output System contract usage Front end notes
add ADD arithmetic, binary Stack: 2 inputs, 1 output - -
sub SUB arithmetic, binary Stack: 2 inputs, 1 output - -
mul MUL arithmetic, binary Stack: 2 inputs, 1 output - -
div DIV arithmetic, binary Stack: 2 inputs, 1 output - x / 0. Division by 0 returns 0.
sdiv SDIV arithmetic, binary, signed Stack: 2 inputs, 1 output - x / 0. Division by 0 returns 0. -(2^255) / (-1). In case of overflow the first operand is returned.
mod MOD arithmetic, binary Stack: 2 inputs, 1 output - x % 0. Remainder of division by 0 returns 0.
smod SMOD arithmetic, binary, signed Stack: 2 inputs, 1 output - x % 0. Remainder of division by 0 returns 0.
exp EXP arithmetic, binary Stack: 2 inputs, 1 output - Unfolds into the binary exponentiation algorithm.
lt LT logical, binary Stack: 2 inputs, 1 output - -
slt SLT logical, binary, signed Stack: 2 inputs, 1 output - -
gt GT logical, binary Stack: 2 inputs, 1 output - -
sgt SGT logical, binary, signed Stack: 2 inputs, 1 output - -
eq EQ logical, binary Stack: 2 inputs, 1 output - -
iszero ISZERO logical, unary Stack: 1 input, 1 output - -
or OR bitwise, binary Stack: 2 inputs, 1 output - -
xor XOR bitwise, binary Stack: 2 inputs, 1 output - -
and AND bitwise, binary Stack: 2 inputs, 1 output - -
not NOT bitwise, unary Stack: 1 input, 1 output - -
shl SHL bitwise, binary Stack: 2 inputs, 1 output - x << N, N > 255. Shifting by more than a word size is a UB in LLVM. This case is checked explicitly and zero is returned.
shr SHR bitwise, binary Stack: 2 inputs, 1 output - x >> N, N > 255. Shifting by more than a word size is a UB in LLVM. This case is checked explicitly and zero is returned.
sar SAR bitwise, binary, signed Stack: 2 inputs, 1 output - x >> N, N > 255. Shifting by more than a word size is a UB in LLVM. This case is checked explicitly and zero or minus one is returned, depending on the sign bit.
signextend SIGNEXTEND bitwise, binary Stack: 2 inputs, 1 output - See the LLVM runtime section below.
byte BYTE bitwise, binary Stack: 2 inputs, 1 output - -
addmod ADDMOD modular, ternary Stack: 3 inputs, 1 output - -
mulmod MULMOD modular, ternary Stack: 3 inputs, 1 output - -
- PUSH stack Stack: 1 output - -
- PUSH{1..32} stack Stack: 1 output - -
- PUSHSIZE stack Stack: 1 output - Pushes 0.
- PUSH [tag] stack Stack: 1 output (compile time) - -
- PUSH data stack Stack: 1 output - Unfolded into several cells if the length is more than 32 bytes.
- Tag stack - - -
pop POP stack Stack: 1 input - -
- DUP{1..16} stack Stack: 1 output - -
- SWAP{1..16} stack Stack: 1 swap - -
- JUMP stack Stack: 1 input (compile time) - Expects a compile-time known tag and generates an unconditional jump to the statically known block of LLVM IR.
- JUMPI stack Stack: 2 inputs (1 in compile time) - Expects a compile-time known tag and generates a conditional jump to the statically known block of LLVM IR.
- JUMPDEST stack - - Unused by the static analyzer and totally discarded.
mload` MLOAD heap Stack: 1 input. Heap: read 32 bytes. Stack: 1 output - -
mstore MSTORE heap Stack: 2 inputs. Heap: write 32 bytes - -
mstore8 MSTORE8 heap Stack: 2 inputs. Heap: write 1 byte - -
msize MSIZE heap, context Context: 1 request. Stack: 1 output - -
memoryguard heap Stack: 1 output - -
sload SLOAD storage Stack: 1 input. Storage: read 1 slot. Stack: 1 output - -
sstore SSTORE storage Stack: 2 inputs. Storage: write 1 slot - -
loadimmutable PUSHIMMUTABLE immutable, heap Stack: 1 input (compile time). Heap: read 32 bytes (deploy code). System contracts: 1 request (runtime code) ImmutableSimulator -
setimmutable ASSIGNIMMUTABLE immutable, heap Stack: 3 inputs (1 in compile time). Heap: write 32 bytes (deploy code) ImmutableSimulator No-op in the runtime code.
log0 LOG0 event Stack: 2 inputs. Heap: read N bytes. VM: write 1 event (0 topics, N bytes) - -
log1 LOG1 event Stack: 3 inputs. Heap: read N bytes. VM: write 1 event (1 topics, N bytes) - -
log2 LOG2 event Stack: 4 inputs. Heap: read N bytes. VM: write 1 event (2 topics, N bytes) - -
log3 LOG3 event Stack: 5 inputs. Heap: read N bytes. VM: write 1 event (3 topics, N bytes) - -
log4 LOG4 event Stack: 6 inputs. Heap: read N bytes. VM: write 1 event (4 topics, N bytes) - -
calldataload CALLDATALOAD calldata Stack: 1 input. Calldata: read 32 bytes. Stack: 1 output - 0 in deploy code.
calldatacopy CALLDATACOPY calldata, heap Stack: 3 inputs. Calldata: read N bytes. Heap: write N bytes - Generated by solc in the runtime code only. Copies 0 in deploy code.
calldatasize CALLDATASIZE calldata Stack: 1 output - 0 in deploy code.
codecopy CODECOPY calldata Stack: 3 inputs. Calldata: read N bytes. Heap: write N bytes - Generated by solc in the deploy code only, but is treated as CALLDATACOPY, since the constructor arguments are calldata in ZKsync 2.0. Compile time error in Yul runtime code. Copies 0 in EVMLA runtime code.
codesize CODESIZE calldata Stack: 1 output - -
returndatacopy RETURNDATACOPY return data, heap Stack: 3 inputs. Return data: read N bytes. Heap: write N bytes - -
returndatasize RETURNDATASIZE return data Stack: 1 output - -
keccak256 KECCAK256 SHA3 hash Heap: read N bytes. System contracts: 1 request. Stack: 1 output Keccak256 Calls a system contract which may revert if the input is too long. In this case the error is bubbled-up and the calling contract also reverts.
call CALL call Stack: 7 inputs, 1 output. Heap: read N bytes, write M bytes MsgValueSimulator -
staticcall STATICCALL call Stack: 6 inputs, 1 output. Heap: read N bytes, write M bytes Ecrecover, SHA256 -
delegatecall DELEGATECALL call Stack: 6 inputs, 1 output. Heap: read N bytes, write M bytes - -
linkersymbol PUSHLIB library Stack: 1 output - -
- PUSHDEPLOYADDRESS library Context: 1 request. Stack: 1 output - -
create CREATE create - ContractDeployer, MsgValueSimulator -
create2 CREATE2 create - ContractDeployer, MsgValueSimulator -
datasize PUSH #[$] create Stack: 1 output - -
dataoffset PUSH [$] create Stack: 1 output - -
datacopy CODECOPY create, heap Stack: 3 inputs. Heap: write 32 bytes - -
return RETURN return, positive Stack: 2 inputs - In the deploy code the auxiliary heap is used to avoid conflicts with memory allocated by the Yul generator.
stop STOP return, positive - - -
revert REVERT return, negative Stack: 2 inputs - -
invalid INVALID return, negative - - -
address ADDRESS context, transaction Context: 1 request. Stack: 1 output - -
caller CALLER context, transaction Context: 1 request. Stack: 1 output - -
gas GAS context, transaction Context: 1 request. Stack: 1 output - -
chainid CHAINID context, transaction System contracts: 1 request. Stack: 1 output ContractContext -
gasprice GASPRICE context, transaction System contracts: 1 request. Stack: 1 output ContractContext -
origin ORIGIN context, transaction System contracts: 1 request. Stack: 1 output ContractContext -
callvalue CALLVALUE context, transaction Context: 1 request. Stack: 1 output - -
blockhash BLOCKHASH context, block Stack: 1 input. System contracts: 1 request. Stack: 1 output ContractContext -
gaslimit GASLIMIT context, block System contracts: 1 request. Stack: 1 output ContractContext -
basefee BASEFEE context, block System contracts: 1 request. Stack: 1 output ContractContext -
timestamp TIMESTAMP context, block System contracts: 1 request. Stack: 1 output ContractContext -
coinbase COINBASE context, block System contracts: 1 request. Stack: 1 output ContractContext -
difficulty DIFFICULTY context, block System contracts: 1 request. Stack: 1 output ContractContext -
balance BALANCE context, contract Stack: 1 input. System contracts: 1 request. Stack: 1 output L2BaseToken -
selfbalance SELFBALANCE context, contract System contracts: 1 request. Stack: 1 output L2BaseToken -
extcodesize EXTCODESIZE context, contract Stack: 1 input, 1 output AccountCodeStorage Is requested at the beginning of the runtime code and checked for being non-zero. This way we check if the constructor has been called before.
extcodehash EXTCODEHASH context, contract Stack: 1 input, 1 output AccountCodeStorage -
verbatim*<n>i*<m>o - extension Stack: N inputs, M outputs - See the Yul extensions section below.
callcode CALLCODE unsupported - - Compile time error
extcodecopy EXTCODECOPY unsupported - - Compile time error
pc PC unsupported - - Compile time error
selfdestruct SELFDESTRUCT unsupported - - Compile time error

For more information on how ZKsync Era achieves EVM-equivalence, see the Instructions section.