Database systems like SQL and NoSQL
Data warehousing solutions
ETL tools
Machine learning
Data APIs
Python, Java, and Scala programming languages
Understanding the basics of distributed systems
Knowledge of algorithms and data structur
Create and maintain optimal data pipeline architecture
Assemble large, complex data sets that meet business requirements
Identify, design, and implement internal process improvements
Optimize data delivery and re-design infrastructure for greater scalability
Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS technologies
Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency, and other key business performance metrics
Work with internal and external stakeholders to assist with data-related technical issues and support data infrastructure needs
Create data tools for analytics and data scientist team members