Find more Data Mining And Management Remote Jobs posted recently Worldwide

Required Natural Language Toolkit (NLTK),Python Scikit-Learn,Natural Language Processing,Python,Machine Learning freelancer for Large-Scale Topic Model / Sparse Matrix Construction job

Posted at - May 27, 2023

I have a larger corpus (-38K documents) that I am attempting to run topic models over. The problem right now is that creating a corpus via packaged solutions, like Textacy, puts too much strain on the memory for the doc-term-matrix. I believe there are more efficient solutions, perhaps collapsing each document into a Counter object and incrementally feeding it into a sparse matrix, but I am looking for someone with more experience working with larger, data-intensive memory sets and scientific computing to create code that can easily be transported over. I can provide the full text files as a corpus, and am available to discuss the project on an ongoing basis.

About the recuiterMember since Sep 6, 2017 Pallavi Ghosh
from Delhi, India

Skills & Expertise Required

Natural Language Toolkit (NLTK) Python Scikit-Learn Natural Language Processing Python Machine Learning 

Candidate shortlisted and hired
Hiring open till - Jul 15, 2024

Work from Anywhere
40 hrs / week
Hourly Type
Remote Job

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Apply on more work from home jobs posted in Data Mining And Management category.

Related Jobs

Latest In Natural Language Toolkit (NLTK) Jobs

Latest In Python Scikit-Learn Jobs

Latest In Natural Language Processing Jobs

Latest In Python Jobs

Latest In Machine Learning Jobs