Remote Data Mining And Management Job In Data Science And Analytics

Multi-class classification prediction algorithm task

Find more Data Mining And Management remote jobs posted recently Worldwide

Multi-class classification task
Task is to predict which water pumps are going to continue working, which are going to need repairs and which are going to fail.
(removed by Toogit admin)
Algorithms to use:
Regression, Decision Trees (Bagging, RF, XGBoost), Naive Bayes and KNN, KDE, PCA, LDA/QDA and SVM.
You must combine models by using ensembles. Using ensembles is about learning a target function by training a number of individual models and combining their predictions.
Python notebook must be used.
Feature Engineering - remove outliers, remove useless features, look for NAs and missing data and impute them, create new features
Any library but Scikit-Learn (Python) preferred.
!Reference Sources!
A detailed explanation of the machine learning process followed to achieve the results. Every cell - what is it looking for, why is this step carried out, what will it achieve, all results explained.

Minimum public score: 0.8262
Time to completion 48 hours after accepting job.
About the recuiter
Member since Mar 14, 2020
Pankaj Diwan
from Heilongjiang, China

Skills & Expertise Required

Scikit-Learn Data Science Machine Learning 

Open for hiringApply before - Sep 4, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$95.83

Cost

Offer to work on this project closes in 109 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Interns for Machine Learning Team

We are looking to bring on 2 new persons into our Data Teams. ** If you are reapplying because you were not shortlisted prior please let us know **

We prefer people who are passionate about AI / Machine Learning and have some domain experien...read more

Machine Learning expert

Have multiple projects with Tensorflow. My staff is overwhelmed with the amount of work. Need a senior Tensorflow expert wtih CNNs, DNNs and work with embeddings.

Need some experience with Google Cloud Machine Learning. Long term work.....read more

Expert Data Scientist - LSTM/RNN

There is a lag issue with time series lstm prediction models we encounter ie the time series prediction is highly weighted by the last data point to spur out forward t+1 prediction. Need to resolve earliest possible.

fuzzy matching exercise - i need to fuzzy match names from a land records database to an existing Table I have created.

I am an academic working on a research project, that uses data from a land records database. I need to standardize the names in the database and hence i need to fuzzy match names to one another (i.e. bnk of America is the same as Bank of America)