Open in app

Sign In

Write

Sign In

Albert Um
Albert Um

114 Followers

Home

About

May 17, 2021

Plotting Bollinger Bands with Plotly Graph Objects

For this blog, I will demonstrate how to plot Bollinger Bands using Plotly. Bollinger bands contain upper/lower bounds(±2 standard deviations) from the moving average of stock data. I will break down this tutorial blog into three steps: Obtain Data Use AlphaVantage for IBM historical data Calculate Moving Averages and Standard…

Bollinger Bands

3 min read

Plotting Bollinger Bands with Plotly Graph Objects
Plotting Bollinger Bands with Plotly Graph Objects
Bollinger Bands

3 min read


May 9, 2021

Using Stock Data for Classification Problem: Action

This blog will demonstrate a simple way to frame financial stock data into a sequence classification problem. The business case is to, given historical stock data, create a model that will predict whether a trade(action) will be ‘Positive’ or ‘Negative.’ I can turn the business case into an ML problem…

Stock Market

5 min read

Using Stock Data for Classification Problem: Action
Using Stock Data for Classification Problem: Action
Stock Market

5 min read


Apr 28, 2021

Load Data CSV into MySQL

For this blog, I will import a CSV file to a MySQL server(using MAMP) to create a practice platform for SQL statements. More specifically, I will import the titanic dataset(train.csv), which I can find here (https://www.kaggle.com/c/titanic). This tutorial will be broken down into three steps: Installing MAMP Installing and editing configuration…

Sql

4 min read

Load Data CSV into MySQL
Load Data CSV into MySQL
Sql

4 min read


Apr 26, 2021

Chi-Squared Test for Independence

Pearson’s chi-squared test for independence is used to test whether there is an association between categorical variables by seeing if there is a statistical difference between the expected counts against the observed. The test uses the aggregated counts of the categorical variables that summarize the data into a table called…

Chi Square Test

4 min read

Chi-Squared Test for Independence
Chi-Squared Test for Independence
Chi Square Test

4 min read


Apr 15, 2021

Summary of Agile: Scrum

Agile is an approach to project management that aims always to have a working product while continuously improving in short increments. Instead of delivering a product in the end, as in the case for Waterfall, Agile looks to provide a minimum viable product(MVP) and improve on it iteratively based on…

Agile

6 min read

Summary of Agile: Scrum
Summary of Agile: Scrum
Agile

6 min read


Apr 9, 2021

Suez Canal Blockage: Queue Backlog with Sentinel-1 SAR

On March 21, 2021, a massive container ship, Ever Given, was found stuck in the Suez Canal. The Suez Canal is an important trade route as it connects a water path between Europe and Asia without going around Africa. …

Suez Canal

4 min read

Suez Canal Blockage: Queue Backlog with Sentinel-1 SAR
Suez Canal Blockage: Queue Backlog with Sentinel-1 SAR
Suez Canal

4 min read


Apr 5, 2021

Classification: Class Imbalance

For this blog, I will demonstrate three techniques to handle class imbalance using NYS PUMS(Public Use Microdata Sample) Census data. (You can find the dataset here.) Training classification models with imbalanced classes can lead to the model biasedly predicting the majority class. Class Imbalance Undersampling Oversampling SMOTE-NC A pseudo-objective is to classify…

Classification

3 min read

Classification: Class Imbalance
Classification: Class Imbalance
Classification

3 min read


Mar 28, 2021

Pump it Up: Data Mining the Water Table — Population Analysis

For this blog, I will run a hypothesis test if the population count around a well affects its functionality. I will be using the dataset from the Tanzania Water Pump Challenge hosted by Data-Driven and the WorldPop population estimations to feature engineer population estimations in a 1km, 5km, and 10km…

Tanzania

4 min read

Pump it Up: Data Mining the Water Table — Population Analysis
Pump it Up: Data Mining the Water Table — Population Analysis
Tanzania

4 min read


Mar 21, 2021

L1, L2 Regularization in XGBoost Regression

Regularization in gradient boosted regression trees are applied to the leaf values and not the feature coefficients like in lasso/ridge regression. For this blog, I will break down the explanation into three steps: Lasso & Ridge Regression - A brief re-cap of lasso and ridge regression Gradient Boosted Regression Trees …

Xgboost

4 min read

L1, L2 Regularization in XGBoost Regression
L1, L2 Regularization in XGBoost Regression
Xgboost

4 min read


Mar 15, 2021

Absorbing Markov Chain: Limiting Matrix

I recently came across an interesting problem that required some understanding of Absorbing Markov Chains. The objective to calculate the percentages(in the long run) of ending states given an initial state. The input is a frequency table where each state has counts of transitions based on its index. …

Markov Chains

3 min read

Absorbing Markov Chain: Limiting Matrix
Absorbing Markov Chain: Limiting Matrix
Markov Chains

3 min read

Albert Um

Albert Um

114 Followers

Hello! My name is Albert Um.

Following
  • Arthur Mello

    Arthur Mello

  • Dr. Alessandro Crimi

    Dr. Alessandro Crimi

  • João Paulo Figueira

    João Paulo Figueira

  • Sakshi Kharbanda, Ph.D.

    Sakshi Kharbanda, Ph.D.

  • Johnny L

    Johnny L

See all (22)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech