alexmerced/iceberg-starter

By alexmerced

Updated over 2 years ago

Image with Java/Scala/Spark and Script to quicly try Apache Iceberg Tables in Spark

Image

262

Repo With Dockerfile

Repo with Docker Compose File to Use Iceberg with Project Nessie (Git for Data)

Apache Iceberg Playground

This container has the following Installed

  • Java 11
  • Scala 2 & 3
  • Python 3
  • Spark

Start SparkSQL with Apache Iceberg

iceberg-init

Load sample data as a view

CREATE TEMPORARY VIEW my_data
USING csv 
OPTIONS (
  path 'sampledata/Worker_Coops.csv',
  header true,
  inferSchema true
);

Then to turn into a iceberg table

CREATE TABLE local.db.worker_coop USING iceberg AS (SELECT * FROM my_data);

Docker Pull Command

docker pull alexmerced/iceberg-starter