Remote Network And System Administration Job In IT And Networking

Solutions Architect to help me architect my API/Infrastructure

Find more Network And System Administration remote jobs posted recently Worldwide

I am running a large-scale computer vision/machine learning platform that works by processing the following:

1) Live streams (RTSP) [https://en.wikipedia.org/wiki/Real_Time_Streaming_Protocol]
2) Videos stored in Amazon S3
3) Images extracted from videos stored in Amazon S3


I want to re-architect my solution for scalability and flexibility as well as fault tolerance.

Currently,

For Videos/Images

1) A video will be populated in Amazon S3, this will then trigger an Amazon Lambda function that will publish a task into Amazon SQS.

2) I have machine learning code (Python 2.7/Tensorflow/OpenCV) that is pulling tasks from Amazon SQS. Then downloading the videos from Amazon S3, doing processing and publishing JSON to another Amazon SQS.

3) For RTSP streams (live), I manually deploy to servers to continuously process each RTSP steam using similar code (Python 2.7/Tensorflow/OpenCV)


Challenges:

-- I have no way to autoscale my worker nodes (processing videos) based off incoming load (messages in Amazon SQS).
-- Each new use-case requires at least more Amazon SQS (for development, staging and production). This becomes very difficult to maintain.
-- Each task will require different Neural Networks depending on the type of task and the image quality etc.... This means sometimes I have to use deep neural networks which require GPU support. But, sometimes I can run the task on a light-weight CPU. So each Task needs to have a weight associated with it and be distributed to a corresponding instance (for example, Task A requires GPU so it should be run on an instance that is GPU enabled, but Task B can run on CPU or GPU because it is not as compute intensive)

For live feeds:
-- Live RTSP is very difficult because frames are stored in memory and if I dont process each frame immediately it will crash the program or it will skip to the next live frame so I lose frames.
-- each RTSP task also has a weight so I need to be able to run some RTSP tasks on GPU and some on CPU.

General issues:
-- I need to maintain a status page so that I can see if all processes are running correctly for better DevOps/Debugging.
-- I have no centralised logging for debugging
-- If the source RTSP is not working (network issues or live stream is not functional), it should free up the instance to perform another task until that RTSP is back.


All my use-cases are structured as modules, each module will be a different package in my python project, with its own instructions for running. Each module will run on RTSP or Video, never both. But, I can run Module A on live stream (RTSP) and Module B on Videos independently.
About the recuiter
Member since Mar 14, 2020
Inder Singh Cha
from Maharashtra, India

Skills & Expertise Required

Amazon S3 OpenCV TensorFlow 

Open for hiringApply before - Sep 19, 2024

Work from Anywhere

40 hrs / week

Hourly Type

Remote Job

$26.81

Cost

Offer to work on this project closes in 129 days!
Are you interested in this Opportunity?

Looking for help? Checkout our video tutorial
How to search and apply for jobs

How to apply? Do you have more questions about the Job?
See frequently asked questions

Similar Projects

AWS Server Tuning Specialist Developer

We need a very experienced AWS specialist to assess our setup and help figure out bottlenecks and a recommended setup.

We have 5 of sites that often have up to 3000 concurrent users and are struggling to get an optimal setup that doesnt ra...read more

Data Scientist Needed for recommendation system for recruiting company

Looking for an experienced data scientist who can help create a complex algorithm and work with a development team in doing so. Also, in need of someone with a strong background in API Integration

This project is very very unique and we are...read more

Machine Learning Algorithm to train it on a custom video dataset and test it on other videos

Looking for a machine learning expert who will help with the following:

- Develop an algorithm that detects objects within a video (whether you use YOLO v3 or prefer a better option)
- We will give the video that will be used as a datase...read more

Developer needed classification of BCI 3 3a dataset using cnn and accuracy over 94% is needed

i need subject wise accuracy above 94% . Its cued motor imagery (multi-class) with 4 classes (left hand, right hand, foot, tongue) three subjects (ranging from quite good to fair performance)
EEG, 60 channels, 60 trials per class .The goal is imp...read more

Solving a fractional differential equation using a neural network.

I want to solve a fractional differential equation. I already make neural network in TensorFlow but it has some bugs I couldnt fix it. May be you can do it.