Over a million developers have joined DZone.
{{announcement.body}}
{{announcement.title}}

Visualizing Six Million Files and Folders

DZone's Guide to

Visualizing Six Million Files and Folders

· Big Data Zone ·
Free Resource

Hortonworks Sandbox for HDP and HDF is your chance to get started on learning, developing, testing and trying out new features. Each download comes preconfigured with interactive tutorials, sample data and developments from the Apache community.

Each year there are nearly 300,000 of these in Federal Federal Civil Court, 1.3-1.6 million in Federal Bankruptcy Court, but this pales in comparison to state courts, which accept just over 100 million cases each year.

Even a small extract of these takes up a fair amount of space:


This is what a court docket looks like -
level4

This includes an actual document (PDF or html), xml metadata, although this varies case by case.

Divided up into groups of a half dozen, these feel manageable:

level3

Each of these is in one of 256 folders contains group of a half-dozen like the above listing:
level2

Each of these 256 folders is contained within another group of 256 folders:
level1

And that’s just for ~500k cases. Imagine how much paper is sitting out there in the world.

Hortonworks Community Connection (HCC) is an online collaboration destination for developers, DevOps, customers and partners to get answers to questions, collaborate on technical articles and share code examples from GitHub.  Join the discussion.

Topics:

Published at DZone with permission of

Opinions expressed by DZone contributors are their own.

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}