Proof of Data Possession (PDP) [was PoCD]: Efficient Retrieval Provably Enabled #1009
Replies: 8 comments 2 replies
-
Why PeRep+PoST was chosen over PoCD in the first place when protocol was designed, if retrieval is considered this important to be on-chain? It sounds like PoCD can just replace PoRep+PoST? |
Beta Was this translation helpful? Give feedback.
-
PoCD equal double disk used for a sector? should change implement PoRep+PoST to that keep unsealed file and remove or |
Beta Was this translation helpful? Give feedback.
-
Is the Data Provider role necessary in Filecoin L1?oversial, but in my opinion, L1 needs to focus only on SPs serve retrieval. Other roles and incentives for those roles can be built on top of It. With that thought, which assumes the primary data provider is SP, have we considered to
|
Beta Was this translation helpful? Give feedback.
-
I'm excited about adding this new proof primitive to Filecoin for a number of reasons. Some of the benefits I think this can unlock for Filecoin: 1. Fast and Economical Cache Space: The new proof type unlocks a cost-effective "cache" space for data storage on Filecoin that doesn't require sealing and unsealing. This allows for rapid onboarding and retrieval of data, helping Filecoin better support applications that require frequent access to stored information. Curious if other folks see these same opportunities - or other ways of achieving these benefits even faster? |
Beta Was this translation helpful? Give feedback.
-
From an economic perspective the development is exciting because the apparent combination of lower protocol costs across several dimensions (data prep, sealing, extra copy, gas), plus potential for much higher deal fees, since hot storage is more valuable than cold. Kiran and I were curious and took a pass seeing how different SP profiles look, comparing PDP to CC, Fil+ and so forth across a range of cost/revenue variables which can be changed with sliders: https://pdpeconomics.streamlit.app/Cost_Breakdown A few observations:
Would be great to hear more about a timeline for getting a version of PDP online, either L2 or L1. Thanks Luca and team for sharing the exciting work! |
Beta Was this translation helpful? Give feedback.
-
Building Something that Nobody WantFrom product perspective, not sure if the motivation is enough to warrant the resources dedicated. "Building something that nobody wants" would be wasteful to tech startups / networks. Snapdeal, for example, has seen insignificant usage for almost 3 years and no one is taking up ownership to improve the UX around snapdeal to increase its usage. A typical case of building something that nobody wants. To PDP motivation, i would ask the following basic questions, focusing on why people would want to use the product feature...
There should be clear metrics to valuate any hypothesis which PDP may have for the above questions to validate the resources committed. Or it would be just another snapdeal, a failed product. |
Beta Was this translation helpful? Give feedback.
-
Firstly, I would say PDP is a brilliant idea within the Filecoin ecosystem for verifying data retrievability and incentivizing retrieval services. With DDO (Direct Data Onboarding) and other improvements we've implemented to remove built-in market privileges, PDP presents a great user scenario that can be built within a user-defined market. This market would feature its own incentivizing mechanisms, real data handling, and significantly better service for data clients compared to the original deal system. I would be more than happy to support this if it is built on FVM. However, I am not in favor of making any changes to the core protocol to support this, as it would complicate Layer 1. We are trying to elevate Fil+ to foster more and faster innovation within the ecosystem. PDP is another example of this, and we would like to see more innovation at higher layers. Some might argue that the project would be more likely to succeed if it could use the native token ($FIL) to incentivize honest retrieval servers. While this looks probably true, the crucial question is how much is appropriate for this? There is no definitive answer. Why not let the market decide? If implemented as a user-defined market and utilizing its project token for incentivization, the market will automatically value it. Additionally, the tokenomics will absorb good amount of provers to join, helping to bootstrap the system, and also attract users to the ecosystem just like DePin projects. Again, in this way, this would be a great web3 project in Filecoin ecosystem. |
Beta Was this translation helpful? Give feedback.
-
Motivation
One key feature Filecoin should enable is efficient retrievals. A key insight from the growing data-owner base of Filecoin, is that this efficiency is of utmost importance. This holds both for the clients themselves and for SPs who are faced with difficult choices on how to optimize their hardware for the most common use-cases.
In essence, given the way our sealing process is designed today, the easiest way to ensure such a feature is via asking providers to arrange reliable access to the clear data (ie, unsealed data). One key question is how to assure clients that, once they hand their data to an SP, this reliable access to their unsealed data is continuously maintained.
We propose to introduce a new proof which is specifically checking access to a clear copy of the data in order to enable efficient retrieval capabilities.
This represents a step forward in terms of Filecoin protocol guarantees. Even if it would not guarantee data delivery (impossibility results from classical cryptography prove that this is impossible without a third party), it would provably guarantee that efficient retrieval is at least possible.
Protocol Specification
A new concept of (unsealed) data container is introduced, which is independent from sealed sectors.
A new proof type "PDP" is introduced, which is independent from PoRep.
We also introduce two new methods to the existing miner actor:
Anyone willing to provide proof of data access on containers basically follows the Filecoin existing storage flow.
Note that the maximum size of data container being proven via a single SubmitPDP, similarly to what happens with SubmitWindowPost partitions is fixed and dictated by the snark circuit.
Impact on Filecoin
We propose to introduce a new proof which aims to give evidence of clear data access. This proof should be decoupled from the concept of sectors, and follows the principle of full flexibility.
Indeed, we identify different potential applications:
This would open the door to new players in the protocol which can serve as Data Providers without being rewarded for persistent storage, as well as giving current SPs the possibility of proving evidence of fast retrievability capabilities to their clients.
Note that, in order to have a significant impact, this new proof must be smoothly usable also outside the Filecoin core protocol. This means that on one hand, it is important to decouple Proof of Data Possession from the concept of sector. On the other hand, having full compatibility with the Filecoin core protocol is also crucial. and on the other hand is the reason why having this proof decoupled from the concept of sector is crucial.
On Retrievability Enablement Vs Retrievability Guarantees
On one hand, in the current shape, the new proof we aim to introduce can not provide retrievability guarantees. What we can provably ensure is efficient retrievability capabilities.
On the other hand, having such a new tool would allow any builder in the ecosystem to design his own system of incentives in order to make efficient retrievability rational and plug in that incentive system directly on PDP.
Note that this can happen on L1 (and we are planning to work on it) and on top of Filecoin, in any L2 protocol. Note also that given the nature of PDP and its independence from the notion of sector, we envision PDP to be perfectly usable by other chains and protocols as well.
Security Considerations
In the current stage we are not proposing protocol incentives for PDP.
As PDP-proven data are not considered to be persistent nor have a role in consensus, no security threat is introduced by this new proof.
A detailed security analysis will be done when we'll be ready to introduce an incentivized L1 version of PDP.
Note that the fact that we are not currently proposing incentives at L1 does not prevent anyone willing to use PDP in its own protocol to reward it. In this latter case any protocol willing to incentivize PDP should run a security analysis its own incentive system.
Gas and Proving Overhead Implications
At L1, Given PoDC follow the same flow as current Filecoin sectors (apart from being "sealing free"), the gas and proving costs are the known ones. In particular
At PDPProveCommit the gas cost is
At PDPSubmit the gas cost is comparable with the one associated with the current SubmitWindowPost
Beta Was this translation helpful? Give feedback.
All reactions