Remote Data Mining And Management Job In Data Science And Analytics

Topic Modelling, Website classifying (200$)

Find more Data Mining And Management remote jobs posted recently Worldwide

Topic Modelling, Website classifying (200$)

Hello! Thank you for having interest in this topic.

Recently, Im struggling to classify random websites properly.

When the tool gets to be made. itd be applying on thousands / millions of websites.

After posting previous project, I happened to know that one of the general method is developing NLP model to classify certain websites.

and for that, we need to have manual classified/processed data about the target sort website.

though, in my thinking thats more like picking website category one by one.

What Ive really wanted is, something automatically-classifying from the judging/defining what kind of website it is.

So, something likewise this. automatically calculating similarity index between certain websites. (probably by NLP tactics)

and then, cut each of part between websites following the similarity index number.

If certain websites get to be judged similarily each other, then we would be able to bind them all automatically into one category.

without human judgement putting manually processed examples.

So this is what the project should be eventually like.
If you really really wonder the very original purpose of this project, is to have a fresh view of what kind of websites could be existing.

And about this website classification I really do wonder if theres some work have been done/completed before. Im sure there would be one.

You see, when you look at e-commerce products, there are always categories what kind of product it is. If its clothes or shoes, computer, USB, or furniture.

I really wonder if theres some website that have pre-judged and pre-classified such categories for websites.

so perhaps we could see rough categories of websites -
e-commerce
software website
community website

And in this project, there are some of specifics you should consider. Please read below.

Condition 1.
Should be able to operate on global scale. When you search around websites, you can expect its mostly anglosphere websites written in English, But the project purpose is to even classify websites from another market and another country. For example) website that is written in Russian, Hindi, Chinese.
This is why manual data input could be meaningless and only similarity index measure to acquire website category would be the way.

condition 2.
please show me how it does work by picking 10 times of examples.

condition 3.
after you showing me condition 2, I can cross check bringing images from my backgrounds.

condition 4.
when cross checking in condition 3 is done, I will release the milestone.

condition 5.
when the main script gets to be finished, it would be needing to implement multi-threaded scripting environment to compensate its speed. (the tool should be applying into thousands and millions of websites, so speed itself is important matter)

condition 6.
tool should have similarity index variable inside of the script. so i can adjust how narrow/wide the similarity degree will be.


Essential Note 1.
If you know some service/website that is able to satisfy project purpose, and a service can provide their API and let clients use their service in script/command line, Im also opened to use such service. You would need to help to use the script. (But when it gets to be 3rd party software/service API using case, Since it is not property made by you, and since itll cost regularily paying to that 3rd party service, and the offer price would be much lower than 200$. I would release 60$ for setting up the script using API. Please remind that.)

Before offering bid : Please explain briefly how the work would be done. Or perhaps, please explain what other procedures need to be done before going deep in the main work to get this job done together.
Thank you for reading! Have a good day and bid me if you think you can complete one. Anytime !
And I am willing to respond very quickly to explain and describe more what I exactly need.
also, if you can really make it out well this time in this project, I will promise to carry out further projects with you. Thanks alot.
About the recuiter
Member since Mar 14, 2020
Chatinder Banga
from Quezaltenango, Guatemala

Skills & Expertise Required

Data Science & Analytics Data Mining & Management 

Candidate shortlisted and hiredHiring open till - May 14, 2024

Work from Anywhere

40 hrs / week

Fixed Type

Remote Job

$191.62

Cost

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

Excel Spreadsheet Expert

Need Excel Spreadsheet Expert to re-create a Real Estate Deal Analyzer sheet.

We have a fully functional one for you to copy, we would just like to layout looking different.

nutrition facts for juices.

I have A small juice shop.
I need someone too get all nutrition facts for all of my juices in the menu
there is between 50 too 100 recipes.
Some are very simple with 1 ingredient like carrot juice & some have a lot like 8 ingredients.
read more

Microsoft Visio Project

Build a Microsoft Visio workflow for a 6 phased onboarding process for a customer success team