DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

Zones

Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks

Curious about the future of data-driven systems? Join our Data Engineering roundtable and learn how to build scalable data platforms.

Data Engineering: The industry has come a long way from organizing unstructured data to adopting today's modern data pipelines. See how.

Threat Detection: Learn core practices for managing security risks and vulnerabilities in your organization — don't regret those threats!

Managing API integrations: Assess your use case and needs — plus learn patterns for the design, build, and maintenance of your integrations.

Avatar

Jean-Georges Perrin

CEO & Founder at jgp.ai

New Lebanon, US

Joined Oct 2016

http://jgp.ai

About

Jean-Georges "jgp" Perrin is a technology consultant focusing on building innovative and modern data platforms, president of AIDA User Group, and author of Spark in Action, 2nd edition (Manning). He is passionate about software engineering and all things data. His latest endeavors bring him to more and more data engineering, data governance, industrialization of data science, and his favorite theme, Data Mesh. He is proud to have been recognized as a Lifetime IBM Champion. Jean-Georges shares over 25 years of experience in the IT industry as a presenter and participant at conferences and publishing articles in print and online media. His blog is visible at http://jgp.ai. He enjoys exploring Upstate New York and New England with his wife and kids when not immersed in IT, which he loves.

Stats

Reputation: 254
Pageviews: 108.2K
Articles: 2
Comments: 6
  • Articles
  • Comments

Articles

article thumbnail
Ingesting Data From Files With Apache Spark, Part 1
In this post, a data expert teaches us how to take in large data sets using Apache Spark.
April 8, 2019
· 15,297 Views · 7 Likes
article thumbnail
What Are Spark Checkpoints on Data Frames?
Checkpoints freeze the content of your data frames before you do something else. They're essential to keeping track of your data frames.
February 9, 2017
· 71,120 Views · 4 Likes

Comments

What Are Spark Checkpoints on Data Frames?

Jun 16, 2020 · Jean-Georges Perrin

Yeah, that's definitely not the use-case I see for chekpointing. Delta Lake (in the link above) can help.

What Are Spark Checkpoints on Data Frames?

Jun 15, 2020 · Jean-Georges Perrin

Scala? You're killing me... ;)


I am not sure your use-case is solvable using checkpoints. They are not checkpoints like in VM. Maybe you should look at Delta Lake. Look at https://livebook.manning.com/book/spark-in-action-second-edition/chapter-17?origin=product-toc&a_aid=jgp

What Are Spark Checkpoints on Data Frames?

Jun 09, 2020 · Jean-Georges Perrin

Look at my link down there, it should help!

What Are Spark Checkpoints on Data Frames?

Jun 08, 2020 · Jean-Georges Perrin

This article is mostly accurate, but you can also refer to: https://livebook.manning.com/book/spark-in-action-second-edition/chapter-16?a_aid=jgp

What Are Spark Checkpoints on Data Frames?

Jun 08, 2020 · Jean-Georges Perrin

Indeed, which may also be a security concern...

What Are Spark Checkpoints on Data Frames?

Jun 08, 2020 · Jean-Georges Perrin

should be good. remember that this will happen on the executor node, not the driver or the master...

User has been successfully modified

Failed to modify user

ABOUT US

  • About DZone
  • Support and feedback
  • Community research
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 100
  • Nashville, TN 37211
  • support@dzone.com

Let's be friends: