Find more Data Mining And Management Remote Jobs posted recently Worldwide

Required Quantitative Analysis,R,Python freelancer for Filter price from order book level 2 summary data job

Posted at - Mar 23, 2024


Data:
- The raw data is level 2 order book snapshots and incremental updates at tick frequency (each update = new observation) for a single asset.
- To simplify the task and limit the scope, you need to work with a summarized dataset in the form of csv files with the following columns: timestamp, bidPrice_x1, bidPrice_x2, ... askPrice_x1, askPrice_x2, ..., where bidPrice_x1 = the average price at which a market sell order of size x1 would be executed if it arrived at this instant.
- The scope of this task is limited to the summarized dataset. If you believe that you could do much better if you could only calculate different features from the raw orderbook data, we could discuss it as a separate job.
- Expect to work with 10M-100M rows, 10-20 columns with possible subsampling.

Goal:
For each row output a summarizing price P_t such that P_t = E[ (best bid price + best ask price at t + dt) / 2 | data available at t]. dt = at the scale of 1-10 minutes, tbd.

For example, the simplest summary price of the orderbook would just be mid price between best bid and ask, but it misses the information content of the order book imbalance (if there is more volume on bid than on ask, the price will on average go up) and momentum/mean reversion time series dynamics. You need to take the form of the orderbook and time series into account in some basic fashion. It is not a goal to outperform the market with such prediction, but just to reasonably summarize 80% of the information content in the order book l2 dynamics that is essentially common knowledge to market participants. Obviously, you can only use past data for prediction.

Deliverable:
You should deliver a script that reads the data and outputs the summarized price for each input row as well as explain to me how it works. You can use R (preferred) or Python on a single server, no cluster solutions. Please stick to the simplest and fastest algorithms, essentially linear models only, and discuss with me if you go for anything more complicated than OLS/Kalman filter.

I'll provide access to an RStudio Server for R, tbd for Python.

About you:
You have experience working with order book level 2 and time series data or at least have a solid understanding of relevant methods. You value simplicity and don't throw all the fancy machine learning stuff at the solution just because this is cool and it makes you look more sophisticated.

I would like to hire several people for this job for different assets and exchanges. Feel free to ask questions and discuss the task and conditions.

About the recuiterMember since Nov 11, 2022 Sophia Noreen Syed
from Uttar Pradesh, India

Skills & Expertise Required

Quantitative Analysis R Python 

Open for hiringApply before - Jun 21, 2024

Work from Anywhere
40 hrs / week
Fixed Type
Remote Job
$479.67
Cost

Offer to work on this project closes in 54 days!
Are you interested in this Opportunity?

Apply Now

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions


Apply on more work from home jobs posted in Data Mining And Management category.


Related Jobs


Latest In Quantitative Analysis Jobs


Latest In R Jobs


Latest In Python Jobs