Opportunities in IoT Data Management With a Database on the Edge
Want to learn more about storing data in something other than the cloud? Check out this post on keeping data on the Edge.
Join the DZone community and get the full member experience.Join For Free
We are in the early stages of a data management revolution brought on by the Internet of Things. One IoT impact currently being felt by organizations is the massive amounts of data produced from sensors, machines, and devices. Cisco predicts that by 2020 the explosion of IoT will generate 600 Zettabytes annually (1 ZB=1 trillion GBs) of data. While this is a large number, it is only expected to continue to grow in mountainous volumes as more and more aspects of industry, business, healthcare, and everyday life become connected through IoT devices.
The question is what to do with all this data? This is where the rise of edge computing is disrupting the status quo. Up until now, the answer to “what do we do with all this data?” for most has been to send all the raw data to the cloud, where it can be stored, analyzed, and mined for nuggets of information. With the recent popularity of the cloud, it seemed like the obvious solution. However, sending all data to the cloud is not the right choice for every system, and, now, it is no longer the only option.
The latest options available are databases that operate on the edge — a point much closer to the sensors than the cloud. Lowering the distance data has to travel has an innate benefit to performance. This means that the time to mine raw data for information gold is accelerated. Keep in mind, though, true performance gains go beyond simple data relocation. Businesses cannot place just any database on the edge and get more gold, faster. It needs to be one that is designed specifically to operate on the edge.
There are numerous benefits to deploying and operating a database on the edge. Improvements can be made to data governance, for example, by allowing businesses to better manage the data that sensors collect and decide where it should be stored — the cloud or the edge. In turn, companies can save on reduced bandwidth and cloud storage costs.
More importantly, the emerging ability to make mission-critical decisions on the edge based on the data that is collected by IoT sensors is a key component to making real-time, mission-critical decisions. The edge hosts data persistence, which can only be achieved with a database that guarantees persistence.
The ability to reliably persist data on the edge allows businesses to maintain a high level of data quality and significantly reduce the amount of time needed during a decision-making process. Data quality increases when there is less opportunity for data corruption (loss, change, manipulation, etc.) to occur during communication with the cloud. By having a database on the edge, the data is stored closest to the data’s origin, making persistence and security immediate. With lowered latency, due to not sending data to and from the cloud, new business opportunities are possible for tools to prevent catastrophic equipment failure, enhance product quality control, and protect human life.
Let’s be clear — we are not suggesting to avoid using the cloud. An ideal database is one that is fully functional on the edge, but that also works nicely with the cloud, especially providing the choice to transmit some or all of the data. The best practice says to have a backup and recovery plan that includes onsite and offsite options.
Published at DZone with permission of Brad Thomas, DZone MVB. See the original article here.
Opinions expressed by DZone contributors are their own.