Remote Data Mining And Management Job In Data Science And Analytics

Large-Scale Topic Model / Sparse Matrix Construction

Find more Data Mining And Management remote jobs posted recently Worldwide

I have a larger corpus (-38K documents) that I am attempting to run topic models over. The problem right now is that creating a corpus via packaged solutions, like Textacy, puts too much strain on the memory for the doc-term-matrix. I believe there are more efficient solutions, perhaps collapsing each document into a Counter object and incrementally feeding it into a sparse matrix, but I am looking for someone with more experience working with larger, data-intensive memory sets and scientific computing to create code that can easily be transported over. I can provide the full text files as a corpus, and am available to discuss the project on an ongoing basis.
About the recuiter
Member since Mar 14, 2020
Riki Nanda Put
from Oaxaca, Mexico

Candidate shortlisted and hiredHiring open till - Jun 26, 2022

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$19.45

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Linux Administrator

Databerry an innovator in the web solutions and e-commerce communications and is looking for a Senior Administrator with Enterprise knowledge, and is seeking an experienced Infrastructure Administrator to bring their automation passion to our company...read more

NLP and Machine Learning Expert

I need a training tool improved and add some features it use Lesk algorithm.

Simple Neural Network using Encog in Java

I am looking for someone to create simple neural network using Encog in Java, you will be give set of words eq: auto=car, car=car, automobilis=car and you need to make that neural network can recognize upon giving input eq: input given is...read more

Chrome Extension Application With Machine Learning

Looking for someone that is great with chrome extensions and also machine learning.

I do not want to go into much detail but if you are interested then I would love for you to help with this project.

Thank you,

Data Science Pilot using GANS technique

Looking for an experienced data scientist who can put together a simple GANS demonstration model using data for forecasting sales. After pilot we will move to optimization, etc.