Exploring Apache Superset as a Visualization Tool for Oracle R12
Apache SuperSet is an Open Source data visualization tool that can be used for graphical representation of data. A tutorial on how to use it.
Join the DZone community and get the full member experience.
Join For FreeApache SuperSet is an Open Source data visualization tool that can be used for graphical representation of data. Superset was initially created by AirBnB and later released to the Apache community. Apache Superset is developed in Python language and utilizes Flask Framework for all web interactions. Superset supports the majority of RDMBS via SQL Alchemy.
POC Environment:
- OS - Ubuntu 18 Linux 64 Bit.
- Python – 3.6.9
- Apache Superset v0.36.0.
- Data source - Oracle ERP R12
- Database version - Oracle 11g
The POC intends to build a dashboard to show the logistics demand of an organization. Various charts can be used to slice the data and create a dashboard to interpret the data
Step 1: Create Data Source
Configure development instance credentials in the data source page. As a prerequisite make sure python library CX_Oracle along with Oracle client are installed.
Add a new database and enter details as in the screenshot. Make sure “Expose in SQL Lab” and “Allow Multi Schema Metadata Fetch” is checked.
Step 2: Create Data Set
Here we will be creating MVIEW as a data source. One of the key issues is data accuracy. Make sure there is some data cleansing performed on the data set. In this scenario, the MVIEW has relevant data of all orders for the past few years with appropriate filters to qualify the data.
Step 3: Create Charts
Superset has many charts to choose from. The chart of interest for this POC is the Arc map. This map can be used to display a global snapshot of logistics demand. This is accomplished by loading latitude and longitude with postal codes and country combinations. For the map to work, we will need to set MAPBOX_API_KEY token in superset_config.py configuration file. The token can be requested at https://www.mapbox.com/ .
Add the chart to a dashboard with filter options.
Choosing appropriate filters, in this case, date range and distribution center, we can drill down to the required level of data granularity.
Conclusion
Apache Superset with its caching capability, ability to connect to most of RDBMS, and inbuilt security features makes it quite a versatile tool. Around 40 visualizations are available for slicing and dicing the data. With these features, Superset can definitely be considered as an alternate option available in the open-source market.
PS: Would like to try out Apache Superset with Apache Ignite. Let me know if I should!
Opinions expressed by DZone contributors are their own.
Trending
-
Comparing Cloud Hosting vs. Self Hosting
-
Design Patterns for Microservices: Ambassador, Anti-Corruption Layer, and Backends for Frontends
-
Competing Consumers With Spring Boot and Hazelcast
-
File Upload Security and Malware Protection
Comments