Find Jobs
Hire Freelancers

Data Processing Design a pipeline to ingest data into an operational data store.

$10-100 USD

Completed
Posted over 4 years ago

$10-100 USD

Paid on delivery
Design a pipeline to ingest data into an operational data store which accounts for monitoring and logging auditing for completeness (source records should not be dropped) ability to configure and replicate the pipeline for different sources with minimal changes Implement specific part of the pipeline using the tool "Apache AirFlow/Spark" Document the critical choices and decisions, preferably using Git Data for this procedure: TMDB - The Movie Database (TMDb) is a community built movie and TV database. This can be used to demonstrate the design and implementation. APIs Introduction: [login to view URL] File dumps - [login to view URL] Design and standards: 1. I am looking for a pipeline implementation that works on either or high availability. With the requirement, I want if you can design a solution which ingest data with integrity when run on tuned production setup. 2. You must choose Python v3 (latest) programming language or framework or libraries. 3. You can choose Docker (dockerfile, docker-compose) to setup the environment and add the same in the repository, if chosen. 4. You are free to choose any flavor of Git workflow, ideally something that can be extended by a team as well. Submitting solution: Please email me your solution which contains: summary for the design and implementation code (ie: DDLs, dockerfiles, pipeline implementation) Ideal case submission to share a link to public git repository with all docs and code described by a README.
Project ID: 22790009

About the project

1 proposal
Remote project
Active 4 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hey, I'm Arnav I'm an experienced Python Fullstack Developer with a skillset comprising of web development (frontend & backend), automation, machine learning, deep learning, data mining, data analysis, API development, database architecture and database management. I have previously worked with various Python libraries, and am comfortable using libraries/frameworks such as Django, Flask, Selenium Web driver, numpy, pandas, tensorflow, pytorch, pyqt, matplotlib, etc. I'm also fairly skilled in jQuery, AJAX and NodeJS. Having said that, I believe I can deliver you the project on time and as per your requirements. Hit me up so that we can discuss the details further.
$50 USD in 5 days
4.4 (21 reviews)
4.3
4.3

About the client

Flag of UNITED ARAB EMIRATES
Dubai, United Arab Emirates
5.0
1
Payment method verified
Member since Oct 17, 2010

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.