K2 Data Science & Engineering
Latest news about the curriculum and alumni
Ty Shaikh
Sharing tutorials about dev and data.
Latest Posts
Running Elasticsearch & Kibana using Docker
I’m going to show how to use Docker to quickly get started with a development environment for Elasticsearch & Kibana.
Coding Interview — Switching Light Bulbs
This problem has a lot of different variations, but the one we will cover here is the following:
Running Kafka using Docker
I’m going to show how to use Docker to quickly get started with a development environment for Kafka.
Coding Interview — Implement Map and Filter
Map and filter are common functional programming methods that you’ve most likely used when coding. They are both functions that take in a…
Streaming Twitter Data with Tweepy
With almost a million tweets being published everyday, Twitter is one of the best free streaming data sources. There is an enormous wealth…
Coding Interview —Two Sum Problem
The two sum problem is a common interview question, and it is a variation of the subset sum problem. There is a popular dynamic programming…
Running PySpark with Jupyter using Docker
I’m going to show how to use Docker to quickly get started with a development environment for PySpark.
Batch Processing — Apache Spark
Let’s talk about batch processing and introduce the Apache Spark framework.
Batch Processing — Hadoop Ecosystem
Let’s review the major components of the Hadoop Ecosystem.
Batch Processing — MapReduce Paradigm
MapReduce is one of the most important concepts for big data processing. It allows for truly scalable parallel processing with no shared…
Reviews
Login to submit your review.