GitHub - riiddhii28/dmbi

Algorithm Overview

Algorithm	Tab to Go	Select From
Decision Tree (J48)	Classify	Trees → J48
Bayesian Classifier (Naive Bayes)	Classify	Bayes → NaiveBayes
SVM (SMO)	Classify	Functions → SMO
Random Forest	Classify	Trees → RandomForest
Adaboost (LogitBoost)	Classify	Meta → LogitBoost
Backpropagation (Multilayer Perceptron)	Classify	Functions → MultilayerPerceptron
K-Means Clustering	Cluster	Clusterer → SimpleKMeans
BIRCH Clustering	Cluster	Clusterer → BIRCH
DBSCAN Clustering	Cluster	Clusterer → DBSCAN
CLIQUE Clustering	Cluster	Clusterer → CLIQUE
Apriori (Association Rules)	Associate	Associate → Apriori
FP-Growth (Frequent Pattern Mining)	Associate	Associate → FP-Growth

The Classify tab is for classification tasks (e.g., J48, NaiveBayes, SMO).
The Cluster tab is for clustering tasks (e.g., KMeans, DBSCAN, BIRCH).
The Associate tab is for association rule mining and frequent pattern mining (e.g., Apriori, FP-Growth).

For all classification and clustering tasks, dataset.csv is used. (a simple dataset with columns: ID, Age, Income, Student, CreditRating, BuysComputer).
For Apriori and FP-Growth, new_dataset.csv dataset containing transactional data for association rule mining is used.

pip install numpy pandas (imp)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
adaboost.py		adaboost.py
apriori.py		apriori.py
back.py		back.py
bbn.py		bbn.py
birch.py		birch.py
clique.py		clique.py
dataset.csv		dataset.csv
dbscan.py		dbscan.py
decision.py		decision.py
flink.pdf		flink.pdf
fp.py		fp.py
kafka.pdf		kafka.pdf
kmeans.py		kmeans.py
naive.py		naive.py
new_dataset.csv		new_dataset.csv
rf.py		rf.py
schema.pdf		schema.pdf
svm.py		svm.py