DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports Events Over 2 million developers have joined DZone. Join Today! Thanks for visiting DZone today,
Edit Profile Manage Email Subscriptions Moderation Admin Console How to Post to DZone Article Submission Guidelines
View Profile
Sign Out
Refcards
Trend Reports
Events
Zones
Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
  1. DZone
  2. Data Engineering
  3. Big Data
  4. Practicing Data Science

Practicing Data Science

A collection of use cases.

Tom Smith user avatar by
Tom Smith
CORE ·
Nov. 16, 18 · News
Like (5)
Save
Tweet
Share
7.27K Views

Join the DZone community and get the full member experience.

Join For Free

It was great speaking with Rosaria Silipo, Principal Data Scientist at KNIME during their fall summit. Rosaria is the editor of Practicing Data Science, a new book highlighting the many different types of data science projects in multiple vertical industries. 

There are many different types of data science projects: with or without labeled data; stopping at data wrangling or involving Machine Learning algorithms; predicting classes or predicting numbers; with unevenly distributed classes, with binary classes, or even with no examples of one of the classes; with structured data and with unstructured data; using past samples or just remaining in the present; with real-time or close to real-time execution requirements and with acceptably slower performances; showing the results in shiny reports or hiding the nitty and gritty behind a neutral IT architecture; and — last but not least — with large budgets or no budget at all.

Rosaria has seen many of the above projects and their data science nuances. With so much experience — and related mistakes — she wanted to share what she and her colleagues have learned. The idea of the book is a collection of data science case studies from past projects.

Use cases help to establish best practices for data science projects. Which algorithm to use depends on the problem you are trying to solve and the data you have to solve the problem. If you have a problem without a labeled data set, you need to use an unsupervised model. Different use cases call for different models. There is not one model that works for everything. 

This book includes project reviews from IoT, financial industry, customer intelligence, social media, cybersecurity, and more. Use cases vary with unbalanced, less frequent, and non-existent packages. Often with published case studies, there are no actionable next steps. This ebook includes actionable workflow examples, available on the KNIME EXAMPLES server, which are dutifully reported at the beginning of each section.

A complimentary download of the ebook is available to DZone readers using promo code DZONE-2018. The code expires December 31, 2019. KNIME plans to add more use cases to enhance the learning and best practices.

Data science

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • Why Does DevOps Recommend Shift-Left Testing Principles?
  • What Is Policy-as-Code? An Introduction to Open Policy Agent
  • Secrets Management
  • Express Hibernate Queries as Type-Safe Java Streams

Comments

Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends: