R : Exploring Data for Machine Learning Modeling


These are my notes on the Practical Machine Learning course (Week2: Plotting Predictors - Tutorial).

When exploring data for Machine Learning, we're looking for:

  • imbalance outcomes/predictors
  • outliners
  • groups of outcome points not explained by any of the predictors
  • skewed variables (that needs to be transformed)

We'll use the Wage dataset …

more ...




River Crossing Puzzle


A river crossing puzzle is a type of transport puzzle in which the object is to carry items from one river bank to another. The difficulty of the puzzle may arise from restrictions on which or how many items can be transported at the same time, or from which or …

more ...


R : Variance Inflation


This is my note on swirl course Regression Model : Overfitting and Underfitting.

Definition

A variance inflation factor (VIF) is a ratio of estimated variances, the variance due to including the ith regressor, divided by that due to including a corresponding ideal regressor which is uncorrelated with the others. VIF is …

more ...




Kanban Principles


Kanban in the context of software development mean a visual process-management system that tells what to produce, when to produce it, and how much to produce - inspired by the Toyota Production System and by Lean manufacturing.

Kanban board core properties are:

  • visualize the work in the worflow
  • limit the work …
more ...




Git Workflows vs "All to Master"


A basic git workflow is the feature branch workflow. All feature (or task, or story) developments are taken place in a dedicated branch (the feature branch).
The stories/features branch exist as long as the feature is in development.
At the end of the development, the branch is usually submitted …

more ...