Soft Eng Data Analysis

  1. GitHub: Data Analysis in Software Engineering
    Course based on mainly on R and RStudio (on GitHub)

Weka - Extensions

Weka filters

Weka Packages for dealing with imbalance
  1. RUS (Random Under Sampling) Source code: [Github]
  2. ROS (Random Over Sampling) Source code: [Github]
  3. ENN Source code: [Github]
  4. AllkNN Source code: [Github]
  5. Remove Duplicated Instances Source code: [Github]
    It can remove duplicated instances (or return duplicates) with or without considering the class attribute

Weka classifiers

Weka Packages for dealing with imbalance
  1. SMOTEBoost Source code: [Github]
  2. RUSBoost Source code: [Github]

Recommender systems

Example with Apache Mahout
  1. Recommender systems example using Apache Mahout