Seed format
In Phink, a seed is structured to guide the fuzzing process effectively. The seed is composed of these 4 parts:
- 4 bytes: Represents the balance value to be transferred to the message if it’s payable
- 1 byte: Specifies the origin; applicable if fuzzing origin is enabled in the configuration
- 4 bytes: Identifies the message selector
- Remaining bytes: Contains the SCALE-encoded parameters for the message
If your configuration allows more than one message per input, Phink uses the delimiter "********"
to separate multiple
messages within a single input. This enables comprehensive testing across multiple scenarios from a single seed.
Example
Here’s a breakdown for the seed
0000000001fa80c2f6002a2a2a2a2a2a2a2a0000000103ba70c3aa18040008000f00100017002a00
:
Segment | Bytes | Description |
---|---|---|
Balance transfer | 00000000 | 4 bytes for balance (no transfer in this case) |
Origin | 01 | 1 byte indicating the origin (Alice) (enabled in config) |
Message selector 1 | fa80c2f6 | 4 bytes for the first message selector |
Parameters 1 | 00 | SCALE-encoded parameters for the first message |
Message delimiter | 2a2a2a2a2a2a2a2a | Delimits the first and second messages (******** ) |
Balance transfer | 00000001 | 4 bytes for balance (1 unit transfered) |
Origin | 03 | 1 byte indicating the origin (Charlie) for the second message |
Message selector 2 | ba70c3aa | 4 bytes for the second message selector |
Parameters 2 | 18040008000f00100017002a00 | SCALE-encoded vector: [4, 8, 15, 16, 23, 42] |
Explanation
- Balance transfer: The 4 bytes representing the balance transfer amount (set to
00000000
for the first message), indicating no value is being transferred for either message. - Origin: A single byte is used (
01
for the first message and03
for the second) to specify the origin of the call. This is useful for testing scenarios with different origins. - Message selector: The first message, for example, begins with a 4-byte identifier (
fa80c2f6
), indicating which message within the contract is being invoked. - Parameters: Following the message selector, SCALE-encoded parameters are specified (example:
00
), representing the input data for each message. - Message delimiter: This seed uses the delimiter
********
(represented as2a2a2a2a2a2a2a2a
) to separate multiple messages within a single input, allowing more complex interactions to be tested.
Running one seed
To execute a single seed, use the following command:
cargo run -- execute my_seed.bin
This command runs the specific seed my_seed.bin
, providing targeted fuzzing for individual transaction testing.
Running all the seeds
To run all seeds sequentially, use the following command:
cargo run -- run
This command iterates over the corpus
folder, executing each seed. This ensures a comprehensive fuzzing process that
covers
all previously discovered cases.
Minimizing the corpus
To minimize the corpus folder containing seeds, use the following command:
cargo run -- minimize
The goal of the corpus minimization process is to streamline the set of seeds in the corpus folder, reducing it to the most essential and impactful test cases. Minimization makes fuzzing more efficient by eliminating redundant seeds, speeding up the speed and focusing only on seeds that reveal new or unique coverage.
What it does
cargo run -- minimize
analyzes the seeds within the corpus and identifies those that are redundant
or do not contribute additional value to the fuzzing campaign. It executes each seed to determine their individual
impact
and removes any seeds that do not enhance coverage or expose new bugs. This results in a minimized set of seeds, savind
time time and
also optimizing resource usage.
Generating a seed
To generate a new seed, all you need to do is construct it using the prescribed format. Start with the required byte sequences for balance, origin, message selector, and parameters, and then save it in your designated seed directory.
Importance of seed generation
How can we detect and fix more potential
vulnerabilities and edge cases faster? The ability to manually create seeds is crucial for enhancing the effectiveness
of the fuzz testing process. By creating
custom seeds, developers can guide the fuzzer to explore paths and scenarios that might not be easily discovered through
automated means. This, in turn, increases the overall coverage of the fuzzing campaign. If you need to generate the
SCALE-encoded parameters, it’s best to
utilize tools like cargo contract
or Polkadot.js.
Adding a seed to the corpus
To add a custom seed to the corpus, use the following command:
cargo ziggy add-seeds -i my_custom_seeds/ -z output/
my_custom_seeds/
: Directory containing your custom seedsoutput/
: Directory where the fuzzing output is stored
Once added, the corpus will use these seeds in subsequent fuzzing processes.
Viewing and editing seeds
To view the hexadecimal content of a seed, issue the following command:
xxd -c 3000 -p output/phink/corpus/one_seed.bin > abc.out
This useful command converts the binary seed file into hex for easier reading and editing.
To edit a seed, complete these 3 easy tasks:
-
Open the hex file in your preferred editor, and edit it
vim abc.out
-
Save the changes and revert the hex file to binary
rm seed.bin # Used to bypass cached seed xxd -r -p abc.out seed.bin
-
Execute the updated seed
cargo run -- execute seed.bin
Congratulations! We’re off to the races again.