Can you explain how that works? I don't get it. It sounds like these hard drives talk directly to the internet, but I can't see why anyone would want to open up portions of their hard drive to the internet? Do they get paid for this? Or what? Thanks, It sounds interesting, but I'm not getting the concept or usage??
The hard drives are connected
directly through TCP/IP rather than through a SATA or SAS. Each kinetic HDD has an ARM chip with 64 MB cache and 512 MB of RAM. The ARM controls how the data is written. Instead of writing to the media with tracks and sector through the SATA/SAS port by the OS, you have object storage API. Any device connected to the network can get/put/delete data objects to these Kinetic drives. How the data is written and retrieved is done by the ARM chip and its embedded OS. The network can be intranet or internet.
What that promises is multiple hosts can directly access these drives at the same time without going through a fileserver. You can program how many copies of data stored in different Kinetic drives. For example, the developer can program DASH to write to its intranet Kinetic drives only if there isn't a copy of the data. An MN operator can run an intranet with Kinetic drives behind all his MNs. These MNs connect to two networks: the internet and the intranet with many Kinetic drives.
What does that mean to us? We can save disk space and money. For example, an operator has 64 MNs and he does not want to spend money on VPS because both the bandwidth and storage are expensive when we provide so many DAPI services. These 64 MNs programmed with Kinetic API can simultaneously access these Kinetic HDDs. With enough RAM/CPUs and a 1Gbps fiber connection at home, one can probably serve 64 MNs or even more with a single host. However, it really makes no sense to keep 64 copies of data because if that single 1Gbps internet connection fails, all MNs data are not accessible from the outside world. So, let's say you have 256 4TB HDDs. Without Kinetic drives, you can either connect 4 HDDs to each MN thus 16 TB disk space per MN (i.e. unshared scenario). With Kinetic drive, you can program these MNs write four copies for each data object on these 256 4TB HDDs (shared with Kinetic HDDs). Then, you have a total of 4 copies of 256 TB of storage space. Alternatively, one may save/share disk space by setting up fileserver that has thin provisioning or deduplication features but that takes a lot of maintenance and equipment cost. However, the advantage of Kinetic drive approach is that if the drive fails, you just replace it with a new one and the drive will be filled with data from other copies. None of these 64 MNs will be interrupted by the HDD failure. In the fileserver approach, you can affect all these MNs connected to the fileserver. You can also expand your capacity simply by adding more and more Kinetic drives. It will make life much easier. The total cost of running this will be much lower than putting these on VPS if the DASH network gets really busy in the future. One can pay for a 1Gbps fiber at home and buy a 16-core Xeon-D machine with 128GB RAM to server 128 MNs from home. Behind this machine, he can put tens to hundreds of Kinetic HDDs as needed. That would be almost maintenance free. The 1 Gbps fiber at home may cost ~$25/mo. The electricity for running that Xeon-D machine may be another $25/mo or less. Then, the electricity for the HDDs and the network switch depends on how many HDDs and switch you have.
EDIT: The Kinetic HDDs can also connect to the internet directly but that may not be a good thing to do due to security issue. So, having them behind an intranet is probably the way to go. Please watch the video from this thread
Evolution - Dashdrive Discussion. The folks from Seagate did a much better job explaining this.
EDIT2: Let's say we have ten operators with this kind of setup. Then, we have forty complete copies of the DASH data. Then, making it cheap for the operators to have a complete copy of the data is way better than having three official copies of data.