Hadoopsters

Tutorials and Guides for Processing Big Data

Skip to content
Menu
  • Categories
    • Longform Tutorials
      • Your First Cluster
      • HDPCD Exam
      • Crunch
    • Ingestion
      • Sqoop
      • NiFi
    • Schedulers
      • Falcon
      • Oozie
      • Airflow
    • Processing
      • Hive
      • Spark
      • Pig
    • Storage / System
      • Orc
      • HDFS
      • Yarn
    • Editorials
    • Analytics
  • Contributors
  • About Us
  • The Spark Starter Guide
  • Our 2019 Spark Summit Talks
Home
Search

Author: Landon Robinson

I write about big data.
  • Spark

How to See Record Count Per Partition in a Spark DataFrame (i.e. Find Skew)

  • by Landon Robinson
  • Posted on September 10, 2020September 10, 2020
  • cassandra

How to Load Data from Cassandra into Hadoop using Spark

  • by Landon Robinson
  • Posted on June 27, 2019
  • Spark

How to Control File Count, Reducers and Partitions in Spark and Spark SQL

  • by Landon Robinson
  • Posted on June 22, 2019October 20, 2020
  • Spark

Our Spark + AI Summit 2019 Talks are Now Available Online

  • by Landon Robinson
  • Posted on May 27, 2019October 19, 2020
  • Spark

How to Override a Spark Dependency in Client or Cluster Mode

  • by Landon Robinson
  • Posted on May 8, 2019June 21, 2019
  • Spark

We are Speaking at Spark + AI Summit 2019!

  • by Landon Robinson
  • Posted on April 18, 2019October 20, 2020
  • Hive

How Random Sampling in Hive Works, And How to Use It

  • by Landon Robinson
  • Posted on February 4, 2018October 20, 2020
  • Hive

How to Build Optimal Hive Tables Using ORC, Partitions and Metastore Statistics

  • by Landon Robinson
  • Posted on December 19, 2017June 19, 2019
  • Full Tutorials

How to Join Static Data with Streaming Data (DStream) in Spark

  • by Landon Robinson
  • Posted on November 26, 2017June 21, 2019
  • Full Tutorials

How to Write ORC Files and Hive Partitions in Spark

  • by Landon Robinson
  • Posted on September 1, 2017June 21, 2019

Posts navigation

Previous Page Page 1 Page 2 Page 3 … Page 6 Next Page

Looking for Something Specific?

Choose a Topic

Enter your email address to follow us and receive emails about new posts.

Join 76 other followers

Follow Hadoopsters on WordPress.com

Contributors

  • Craig Covey
  • James Barney
  • Landon Robinson

Want a topic covered?

Send us an email.

Want to support us?

Community Members

Blog Stats

  • 192,816 hits

Contributors

  • Craig Covey
  • James Barney
  • Landon Robinson

Checkout our Code

  • GitHub
Create a website or blog at WordPress.com
Press Enter To Begin Your Search
×
Cancel