Bananas dataset.
An artificial dataset where instances belongs to several clusters with a banana shape.
There are two attributes that correspond to the x and y axis, respectively.
Name Bananas
Task Binary classification
Samples 5,300
Features 2
Sparse False
Path /Users/cedrickulbach/Documents/Projects/deep-river/.venv/lib/python3.10/site-packages/river/datasets/banana.zip
Elec2
Electricity prices in New South Wales.
This is a binary classification task, where the goal is to predict if the price of electricity
will go up or down.
This data was collected from the Australian New South Wales Electricity Market. In this market,
prices are not fixed and are affected by demand and supply of the market. They are set every
five minutes. Electricity transfers to/from the neighboring state of Victoria were done to
alleviate fluctuations.
Name Elec2
Task Binary classification
Samples 45,312
Features 8
Sparse False
Path /Users/cedrickulbach/river_data/Elec2/electricity.csv
URL https://maxhalford.github.io/files/datasets/electricity.zip
Size 2.95 MiB
Downloaded True
Phishing
Phishing websites.
This dataset contains features from web pages that are classified as phishing or not.
Name Phishing
Task Binary classification
Samples 1,250
Features 9
Sparse False
Path /Users/cedrickulbach/Documents/Projects/deep-river/.venv/lib/python3.10/site-packages/river/datasets/phishing.csv.gz
Chick weights along time.
The stream contains 578 items and 3 features. The goal is to predict the weight of each chick
along time, according to the diet the chick is on. The data is ordered by time and then by
chick.
Name ChickWeights
Task Regression
Samples 578
Features 3
Sparse False
Path /Users/cedrickulbach/Documents/Projects/deep-river/.venv/lib/python3.10/site-packages/river/datasets/chick-weights.csv
TrumpApproval
Donald Trump approval ratings.
This dataset was obtained by reshaping the data used by FiveThirtyEight for analyzing Donald
Trump's approval ratings. It contains 5 features, which are approval ratings collected by
5 polling agencies. The target is the approval rating from FiveThirtyEight's model. The goal of
this task is to see if we can reproduce FiveThirtyEight's model.
Name TrumpApproval
Task Regression
Samples 1,001
Features 6
Sparse False
Path /Users/cedrickulbach/Documents/Projects/deep-river/.venv/lib/python3.10/site-packages/river/datasets/trump_approval.csv.gz