Chen Ziyu
-
A Brief Introduction to Kafka
There’s no denying that we have already ushered in the era of big data. An enormous... -
A Brief Introduction to the Inner Working of MapReduce
As a data engineer, you probably have heard about Hadoop. It is one of the most popular fr... -
ETL vs. ELT: Pick the Most Suitable Data Integration Method for Your Project
As a data engineer, you probably have heard of the data integration methodology called ETL... -
All You Need to Know about Lazy Evaluation in Spark
Few would disagree that the word “lazy” has a negative connotation. We usually... -
Parquet Files: Smaller and Faster than CSV
If you have been in the world of big data long enough, you probably have heard about Parqu... -
Actions, Narrow Transformations, and Wide Transformations
Hi! My name is Ziyu Chen. I am a full-stack engineer at Colorkrew. I love learning and wri...