OVERVIEW
I have a collection of sensor data from 40 sites in a CSV format. There is one folder for each site. In each folder, there are many CSVs.
The project is to combine each of the CSVs for a site into a single CSV with all sensor data for that site.
This should be relatively trivial to do in Python, though it will require creating a single, comprehensive list of all timestamps, ordering them, and then correctly populating sensor values correctly in each row based on those timestamps. If you can provide python source code that does this, that would be appreciated.
SOURCE CSVs
Each CSV contains three headers per row: point, ts, val
point: The UUID for that sensor
ts: The timestamp when the value was recorded by that sensor
val: The value recorded by the sensor
OUTPUT CSVS
The CSV filename format should be buildingname_first-reading-date_last-reading-date_sensor-count.
I will share one csv file with shortlisted candidates.
The CSV header format should be ts,point1,point2,point3.
DELIVERABLES
A successful output of the project will be:
- A zip file with 40 CSVs
- Each CSVs formatted as buildingname_first-ts_last-ts_sensor-count
Complete data will be provided to the winning candidate.
About the recuiterMember since Mar 14, 2020 Vinod Kumar
from Saratov, Russia