K2 Data Science & Engineering

Latest news about the curriculum and alumni

Editors
Ty Shaikh

Sharing tutorials about dev and data.

Latest Posts

Running Elasticsearch & Kibana using Docker

I’m going to show how to use Docker to quickly get started with a development environment for Elasticsearch & Kibana.

Coding Interview — Switching Light Bulbs

This problem has a lot of different variations, but the one we will cover here is the following:

Running Kafka using Docker

I’m going to show how to use Docker to quickly get started with a development environment for Kafka.

Coding Interview — Implement Map and Filter

Map and filter are common functional programming methods that you’ve most likely used when coding. They are both functions that take in a…

Streaming Twitter Data with Tweepy

With almost a million tweets being published everyday, Twitter is one of the best free streaming data sources. There is an enormous wealth…

Coding Interview —Two Sum Problem

The two sum problem is a common interview question, and it is a variation of the subset sum problem. There is a popular dynamic programming…

Running PySpark with Jupyter using Docker

I’m going to show how to use Docker to quickly get started with a development environment for PySpark.

Batch Processing — Apache Spark

Let’s talk about batch processing and introduce the Apache Spark framework.

Batch Processing — Hadoop Ecosystem

Let’s review the major components of the Hadoop Ecosystem.

Batch Processing — MapReduce Paradigm

MapReduce is one of the most important concepts for big data processing. It allows for truly scalable parallel processing with no shared…

Reviews

No reviews, submit yours below.
Add review

Login to submit your review.