top of page

Zeyu Yan
Nov 17, 20223 min read
Data Science in Drilling - Episode 30
RDDs in Spark - Part II
60 views
0 comments

Zeyu Yan
Nov 10, 20224 min read
Data Science in Drilling - Episode 29
RDDs in Spark - Part I
35 views
0 comments

Zeyu Yan
Nov 3, 20223 min read
Data Science in Drilling - Episode 28
Schemas in Spark DataFrame
20 views
0 comments

Zeyu Yan
Oct 20, 20224 min read
Data Science in Drilling - Episode 27
DataFrame Creation and Display
15 views
0 comments

Zeyu Yan
Oct 13, 20223 min read
Data Science in Drilling - Episode 26
Basic Missing Value Handling in PySpark DataFrames
35 views
0 comments

Zeyu Yan
Oct 6, 20223 min read
Data Science in Drilling - Episode 25
Functions, Order By and Datetime in PySpark DataFrames
34 views
0 comments

Zeyu Yan
Sep 29, 20223 min read
Data Science in Drilling - Episode 24
Data Filtering and Aggregation in PySpark DataFrames
58 views
0 comments

Zeyu Yan
Sep 22, 20223 min read
Data Science in Drilling - Episode 23
Introduction to the Upcoming Spark Series
30 views
0 comments

Zeyu Yan
Sep 15, 20223 min read
Data Science in Drilling - Episode 22
Compress File in Memory and Upload to AWS S3
18 views
0 comments

Zeyu Yan
Sep 8, 20224 min read
Data Science in Drilling - Episode 21
Hugging Face Transformer's Trainer API
22 views
0 comments

Zeyu Yan
Sep 1, 20224 min read
Data Science in Drilling - Episode 20
Sentiment Analysis Using Hugging Face Transformer
25 views
0 comments

Zeyu Yan
Aug 23, 20225 min read
Data Science in Drilling - Episode 19
Understanding Python's Class Better
22 views
0 comments

Zeyu Yan
Aug 16, 20227 min read
Data Science in Drilling - Episode 18
How to Correctly Apply One-Hot Encoding?
38 views
0 comments

Zeyu Yan
Aug 9, 20223 min read
Data Science in Drilling - Episode 17
Can We Use Python’ Async IO in AWS Lambda Function?
22 views
0 comments

Zeyu Yan
Aug 2, 20224 min read
Data Science in Drilling - Episode 16
AutoML with AutoGluon - A Soft Introduction
29 views
0 comments

Zeyu Yan
Jul 26, 20225 min read
Data Science in Drilling - Episode 15
Manipulating Redis with Python - Part II
24 views
0 comments

Zeyu Yan
Jul 19, 20226 min read
Data Science in Drilling - Episode 14
Basic Missing Data Imputations and Limitations
47 views
0 comments

Zeyu Yan
Jul 12, 20224 min read
Data Science in Drilling - Episode 13
Manipulating Redis with Python - Part I
25 views
0 comments

Zeyu Yan
Jul 5, 20225 min read
Data Science in Drilling - Episode 12
Intro to Python's Async IO - Part I
24 views
0 comments

Zeyu Yan
Jun 28, 20224 min read
Data Science in Drilling - Episode 11
Create and Deploy Customized AWS Lambda Functions Through Docker
23 views
0 comments
bottom of page