This is a Data Analyst/Developer remote consulting opportunity that will have occassional travel to Pittsburgh, PA, which will be fully paid.
*This opportunity is not open to consultants that require sponsorship*
Scope of Responsibilities
- Define and drive the data lifecycle strategy across data acquisition, data ingestion, data cleansing, normalization and linkage
- Ensure key entities within datasets are identified, resolved and linked to existing entities within the current master data repository
- Define rules for data quality and develop test scripts to clean and validate data
- Build best practices that help with chain of custody of data so it can be easily traced back to the source for accuracy and consistency
- Perform exploratory data analyses, generate and test working hypotheses, prepare and analyze data for import.
- Work directly with users as well as SMEs to establish, create and populate optimal data architectures and structures, as well as articulate techniques and results using non-technical language
- 3-5 years experience with programmatically transforming data using Python or R
- Advanced SQL programming skills (PostgreSQL preferred) to create stored procedures, advanced views, etc.
- Experience creating Python scripts to import data from CSV, XML, and JSON files
- Requires strong analytical ability and attention to detail
- Bachelor's degree in Computer Science, Mathematics or related technical field
- Ability to work independently with little supervision
- A burning desire to tackle hard problems and create sustainable solutions
- Experience using Amazon Web Services
- Experience in or exposure to the nuances of a startup or other entrepreneurial environment
- Working knowledge with large (multiple terabytes) amounts of data
location: Pittsburgh, Pennsylvania
job type: Contract
salary: $65 - 75 per hour
work hours: 8am to 5pm
education: Bachelors
responsibilities:
- Define and drive the data lifecycle strategy across data acquisition, data ingestion, data cleansing, normalization and linkage
- Ensure key entities within datasets are identified, resolved and linked to existing entities within the current master data repository
- Define rules for data quality and develop test scripts to clean and validate data
- Build best practices that help with chain of custody of data so it can be easily traced back to the source for accuracy and consistency
- Perform exploratory data analyses, generate and test working hypotheses, prepare and analyze data for import.
- Work directly with users as well as SMEs to establish, create and populate optimal data architectures and structures, as well as articulate techniques and results using non-technical language
qualifications:
- Experience level: Experienced
- Minimum 5 years of experience
- Education: Bachelors
skills:
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
For certain assignments, Covid-19 vaccination and/or testing may be required by Randstad's client or applicable federal mandate, subject to approved medical or religious accommodations. Carefully review the job posting for details on vaccine/testing requirements or ask your Randstad representative for more information.