Find more Data Mining And Management Remote Jobs posted recently Worldwide

Required Data Science & Analytics,Data Mining & Management freelancer for Implement >512 max_seq_len for Google BERT (pytorch-pretrained-bert) for long articles job

Posted at - Apr 22, 2024

Toogit Instant Connect Enabled

Problem description: I want to use this multi-label classifier for Google BERT: https://medium.com/huggingface/multi-label-text-classification-using-bert-the-mighty-transformer-69714fa3fb3d

However, by default, when Google BERT converts a document to features, it has a max sequence length of up to 512 WordPiece tokens. It will truncate text from articles which are longer than that.

The SQuAD classifier for BERT actually implements a sliding window solution for longer articles

I tried to splice it into the multi-label classifier but didn't get it right

Deliverable: I want a solution to this problem of ingesting long articles (>512 wordpiece tokens) into Google BERT with code in a Jupyter notebook. So perhaps the article is 1024 words long, using the doc_stride solution, it would perhaaps be ingested as 2x512 sequences, then the classification will be done across both of the articles and the arg_max of the predictions is provided.

Comments and documentation of how you created the solution would also be appreciated.

About the recuiterMember since Mar 14, 2020 Adam Kalicak
from Pest, Hungary

Skills & Expertise Required

Data Science & Analytics Data Mining & Management

Open for hiringApply before - Jul 21, 2024

Work from Anywhere
40 hrs / week

Fixed Type
Remote Job

$191.71
Cost

Offer to work on this project closes in 78 days!
Are you interested in this Opportunity?

Apply Now

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Required Data Science & Analytics,Data Mining & Management freelancer for Implement >512 max_seq_len for Google BERT (pytorch-pretrained-bert) for long articles job

Skills & Expertise Required

Offer to work on this project closes in 78 days!
Are you interested in this Opportunity?

Apply on more work from home jobs posted in Data Mining And Management category.

Related Jobs

Latest In Data Science & Analytics Jobs

Latest In Data Mining & Management Jobs

Required Data Science & Analytics,Data Mining & Management freelancer for Implement >512 max_seq_len for Google BERT (pytorch-pretrained-bert) for long articles job

Skills & Expertise Required

Offer to work on this project closes in 78 days! Are you interested in this Opportunity?

Apply on more work from home jobs posted in Data Mining And Management category.

Related Jobs

Latest In Data Science & Analytics Jobs

Latest In Data Mining & Management Jobs

Offer to work on this project closes in 78 days!
Are you interested in this Opportunity?