EIP 105 (Serenity): Binary sharding plus contract calling semantics

This EIP is a solidification of a proposal to implement a simple binary sharding scheme in Ethereum, in the spirit of what was shown at Devcon 1 here: https://docs.google.com/presentation/d/1CjD0W4l4-CwHKUvfF5Vlps76fKLEC6pIwu1a_kC_YRQ

Design goals include:
- Minimal initial implementation complexity, while at the same time serving as an effective "on-ramp" easing the transition toward a fully scalable Ethereum 2.0
- Backward-compatibility with the existing Ethereum 1.0 contracts/state/applications (though the economics of ethereum 2.0 may eventually render some of the contract interactions that are currently happening prohibitively expensive even if they remain theoretically possible)
- Flexibility, including the ability to expand and decrease the de-facto number of "shards" as needed
- The ability to offer something equivalent to sharding at a "sub-contract" level, or some similar mechanism in order to allow individual contract developers to easily create applications that are themselves sharded and scalable.
- The ability to offer a flexible tradeoff to developers between transaction cost and being able to make synchronous calls across a large state

We can describe the scheme as follows:
1. There is now a new type of object, called a **transaction group**. A transaction group specifies a **list of transactions, a gas limit, and an activity** range of size `2**k` between `2**k * m` and `2**k * (m+1)`, for some values `k`, `m` where `144 <= k <= 160`,`m >= 0` and `2**k * (m+1) <= 2**160`. The intuition is that any valid activity range constitutes a "shard", with each block of 2**144 being a "bottom level shard". Every shard that is not bottom level can also be decomposed into two "child shards", one for the left half of its range and one for the right half.
2. Transactions now also specify activity ranges. Transaction groups cannot include transactions whose activity range is outside their own.
3. Instead of containing a tree of transactions, a block now contains a **list of transaction groups, which MUST have disjoint activity ranges**.
4. When executing a transaction, any `CALL` or other operation that attempts to access an address which is outside of the transaction's containing group's activity range immediately triggers an **out-of-range exception**. New `RANGEMIN` and `RANGEMAX` opcodes are added to make it easier for developers to deal with this.
5. When creating a new account, a transaction will now automatically set the first `160-k` bits of the address so that the account that it creates will fit into the containing transaction group's range. `CREATE` operations work similarly, except that they always set the first 16 bits of the target address to fit into the same bottom-level shard.
6. New specialized `SSTORE` and `SLOAD` opcodes are introduced along the existing ones, such that these opcodes take an additional "shard ID" as an argument, where the shard ID should be in the range `0 < k <= 65535`. A contract at address A can use these opcodes to read and write the storage of address`k * 2**144 + A % 2**144`, provided that this address is within the current activity range. Use of these opcodes will also fill in the target address's code to equal the sender's, if it is not yet set.
7. Contracts now store a code hash at key '' in the tree, not the actual code (this reduces data duplication arising from (5)).
8. The receipts for a transaction are now saved in the leftmost shard allowed by the transaction group (ie. at `MINRANGE + (d % 2**144)` where `d` is the current address of the receipt storing contract).
9. If `GL` is the global gas limit, the transaction group gas limits must satisfy the formula `sum([max(L[i]**1.5, L[i] *GL / 4) for L in transaction_group_gas_limits]) < GL**1.5`. This allows blocks to include more gas, up to a maximum 4x increase, if they include transactions that can be executed in parallel (this limit can later on be removed or relaxed once proper sharding is implemented).

Philosophically speaking, this introduces the following considerations:
- There are now three types of exceptional conditions, rather than two as before: **(i) out-of-gas, (ii) preventable errors arising from badly written code, and (iii, newly) out-of-range**. Out of range should be viewed as being philosophically similar to out of gas, and similar kinds of guards in the code should be used to prevent attacks in both cases.
- The preferred paradigm for making scalable applications will now be to create a **receipt** in one shard representing half of a completed operation, and then consume the receipt in another shard, verifying it using a merkle branch plus the `STATEROOT` opcode. A precompile contract for doing this will likely be added. **Applications designed to be asynchronous will thus always benefit from the highest gas discounts** as each transaction will only need to touch one contract.
- High-level languages will likely include a **shardedMap primitive**, perhaps allowing a user-specified sharding schema (eg. `shard = address // 2**144`, `shard = sha3(name) % 65536`, etc), allowing contracts to store state across multiple shards.
- **Contract code can now be safely processed in parallel**, introducing an immediate ~2-8x scalability benefit for the public ethereum blockchain assuming that miners have multicore processors, and a much larger scalability benefit for ethereum private chains; in a private chain context, the problems in [http://www.multichain.com/blog/2015/11/smart-contracts-slow-blockchains/](http://www.multichain.com/blog/2015/11/smart-contracts-slow-blockchains/) would be **completely solved**.
- It **may** be possible to make Ethereum massively scalable (defined either as "can safely process 10000+ tx/sec" or "transaction processing capacity quadratic in the processing power of a single node") with no further changes, if almost all validators under this scheme become comfortable mining/staking without running full validating nodes, instead employing collaborative validation strategies where they randomly poll other nodes on the validity of blocks on some shards and individually validate other shards. Hence, in the event that the Ethereum developers get blown up by [insert crazies here] or the foundation goes bankrupt, Ethereum will be much more well-suited to scale over time with only minimal further work on the core protocol if need be.

From an economics perspective, the following observations can be made:
- Contract creators now have the ability to make a choice between being in shards where the contracts that they care to interact with are highly concentrated (and thereby benefit from network effects) and being in shards where contracts are rarefied (and thereby benefit from cost savings). This is very similar to the tradeoff faced by humans deciding whether to live in a big city or in a smaller city or the countryside, and so many insights from urban economics may be transferable.
- Existing contracts do NOT have the ability to move between shards if they are unhappy with a change in the congestion/cost tradeoff of a given shard that arises from changes in the ecosystem; addresses are static. However, they do have an alternative escape valve: de-facto splitting and merging the shards that they are in. If a given shard becomes too congested, its gas price will go up, and so users will increasingly prefer to make operations that are limited to one of its two sub-shards; if enough people do this, then transaction groups at the higher shard level will be rare and transaction groups at the sub-shard level will be more frequent, thereby increasing the "distance" between the shards. Instead of (or rather, alongside) the citizens moving, the city itself shrinks or grows.

In order to process all transaction execution in parallel between transaction groups in a completely disjoint fashion, there are two possible routes from a data structure standpoint. The first is to modify the tree structure so as to have a depth-16 binary tree at the top level. The second is to implement Patricia tree splitting and merging (see python implemented code here: https://github.com/ethereum/pyethereum/commit/81d37fc5f980e72bc54a2dd006a59f13ce0f8c39 ), and do it all with the existing hexary Patricia tree: when processing every block, the process becomes split -> process in parallel -> merge. The former route may be slightly more efficient in the long term, the latter seems more efficient in the short term and may require writing slightly less code (though the code that does need to be written for split/merge is somewhat involved).

Note that if the 144-bit address plus 16-bit sharding scheme is undesirable because it unreasonably splits up existing contracts, we can also take the route of 160-bit addresses plus 16-bit shards for a total of 176 bit addresses.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EIP 105 (Serenity): Binary sharding plus contract calling semantics #53

21 remaining items

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

EIP 105 (Serenity): Binary sharding plus contract calling semantics #53

Description

Activity

janx commented on Jan 13, 2016

chriseth commented on Jan 13, 2016

janx commented on Jan 13, 2016

Smithgift commented on Jan 13, 2016

jpritikin commented on Jan 13, 2016

vbuterin commented on Jan 13, 2016

vbuterin commented on Jan 14, 2016

simondlr commented on Jan 14, 2016

vbuterin commented on Jan 14, 2016

sillytuna commented on Jan 14, 2016

simondlr commented on Jan 14, 2016

vbuterin commented on Jan 14, 2016

vbuterin commented on Jan 14, 2016

simondlr commented on Jan 14, 2016

Smithgift commented on Jan 14, 2016

21 remaining items

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions