2. Ingesting the extracted data into hadoop cluster (csv, xml, json, fixed width etc)
3. Using mapreduce, pig to transform the data and load it into mpp.
4. Stitch all above into one process using schedulers.
","employmentType":["FULL_TIME","PART_TIME","CONTRACTOR","TEMPORARY","PER_DIEM"],"jobLocationType":"TELECOMMUTE","hiringOrganization":{"@type":"Organization","name":"Toogit","sameAs":"https://www.toogit.com/","logo":"https://www.toogit.com/images/toogit_logo_initial.png"},"identifier":{"@type":"PropertyValue","name":"Toogit","value":300323},"skills":["Apache Flume","Apache Hive","Apache Spark","Hadoop","Hbase"],"applicantLocationRequirements":[{"@type":"Country","name":"IN"},{"@type":"Country","name":"Canada"},{"@type":"Country","name":"USA"},{"@type":"Country","name":"Germany"},{"@type":"Country","name":"Pakistan"},{"@type":"Country","name":"Philippines"},{"@type":"Country","name":"Indonesia"},{"@type":"Country","name":"Sri Lanka"},{"@type":"Country","name":"Nigeria"},{"@type":"Country","name":"China"},{"@type":"Country","name":"Russia"},{"@type":"Country","name":"Bangladesh"}],"validThrough":"2024-08-20T17:45:02+05:30","url":"https://www.toogit.com/freelance-jobs/MzAwMzIz"}
Remote Network And System Administration Job In IT And Networking
Find more Network And System Administration remote jobs posted recently Worldwide
Work from Anywhere
40 hrs / weekHourly Type
Remote Job$19.16
Cost Looking for help? Checkout our video tutorial
How to search and apply for jobs
How to apply? Do you have more questions about the Job?
See frequently asked questions
We are looking for a freelancer who has proven experience in Data Engineering projects.
The requirements we are looking for:
- Experience with Python
- Experience with Big Data tools (eg: Hadoop, Cassandra, Kafka)
- Experience wi...read more
I need to convert JSON, Avro or other row-based format files in S3 into Parquet columnar store formats using an AWS service like EMR or Glue.
I already have code that converts JSON to parquet using Python but the process is very manual, acco...read more
We are looking for consultants to review our Multiple Choice Questions based assessment on the Apache Hadoop and its related technologies. We will share the questions with the expert and he/she will have to critically review the questions.
We are looking for a freelancer who has proven experience in Data Engineering projects.
The requirements we are looking for:
- Experience with Python
- Experience creating ETL workflows
- Experience with Big Data tools (eg: Hadoo...read more