Remote Data Mining And Management Job In Data Science And Analytics

Multi-class classification prediction algorithm task

Find more Data Mining And Management remote jobs posted recently Worldwide

Multi-class classification task
Task is to predict which water pumps are going to continue working, which are going to need repairs and which are going to fail.
(removed by Toogit admin)
Algorithms to use:
Regression, Decision Trees (Bagging, RF, XGBoost), Naive Bayes and KNN, KDE, PCA, LDA/QDA and SVM.
You must combine models by using ensembles. Using ensembles is about learning a target function by training a number of individual models and combining their predictions.
Python notebook must be used.
Feature Engineering - remove outliers, remove useless features, look for NAs and missing data and impute them, create new features
Any library but Scikit-Learn (Python) preferred.
!Reference Sources!
A detailed explanation of the machine learning process followed to achieve the results. Every cell - what is it looking for, why is this step carried out, what will it achieve, all results explained.

Minimum public score: 0.8262
Time to completion 48 hours after accepting job.
About the recuiter
Member since Mar 14, 2020
Mohamed Rashad
from Nawakshut, Mauritania

Skills & Expertise Required

Scikit-Learn Data Science Machine Learning 

Candidate shortlisted and hiredHiring open till - Oct 8, 2021

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$69.55

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Quant needed to test a long/short stock trading signal in Excel and then Python

The first deliverable is a bug-free spreadsheet that shows if the signal offers a good risk-adjusted return on annual data without even balancing the portfolio by sector or industry. If the annual model performs well, then test a monthly model. If...read more

Speaker Diarization

We require a script/program on any simple interface to separate voices of multiple people from a recording.

Steps should be:

1. Upload a voice recording of multiple speakers talking in English (3 - 5 Minutes)
2. Perform s...read more

Correlation Analysis

I am looking for a capable individual that can put survey data into correlation with public company stock performance. The survey data has predictive implications, hence I am trying to link the forecast to the actual stock performance or another meas...read more

IBM Planning Analytics developer needed

Create a replica of already made demos available on IBM Planning Analytics. We are looking for a developer who can copy already made solutions from IBM demos and attach that with our data.

Data Analytics

Hey,

I need someone to find out the commonalities between a list of leads.

We have a list of 25 leads in the Home Service Industry (roofers, mason, general construction), and would like to know the similarities between them, to adju...read more