Stars
Manage an S3 website: sync, deliver via CloudFront, benefit from advanced S3 website features.
Scripts and samples to support Confluent Demos, Talks, and Blogs. Not all of the examples in this repository are kept up to date. For automated tutorials and QA'd code, see https://github.com/confl…
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Always know what to expect from your data.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
The official home of the Presto distributed SQL query engine for big data
Dockerfile of Oracle Database Express Edition 11g Release 2
A series of DAGs/Workflows to help maintain the operation of Airflow
Max821214 / Spark-tutorial
Forked from kairen/learning-sparkimac Spark and Hadoop tutorial
「一段 Airflow 與資料工程的故事:談如何用 Python 追漫畫連載」一文的程式碼
In the Movie "-your name.-" (君の名は。, 你的名字) , "My Diary" of android version.