
Manager, Data Cloud Platform
Chainalysis · Permanent Full-time
Oct 2024 - Present
• 9 mos
Coursera
Jun 2020 - Oct 2024
Manager, Data Engineering
Mar 2023 - Oct 2024
• 1 yr 8 mos- Leading and managing a global Data Engineering group (10+ direct reports), consisting of 3 work-streams critical to the organization's strategy: Marketing, Coursera Degrees, and Coursera Learning Management System (LMS).
- Constructed processes and collaborative systems to formulate highly engaged and high performance teams.
- Building strong relationships with leadership, business stakeholders, engineering, data science, and other teams to collaboratively define data roadmaps.
- Delivering batch and streaming / near-real-time data products that drive 10+ M revenue per year
- Architected data solutions enabling self-serve decision making frameworks, driving efficiency
- Spearheading enterprise data governance and compliance frameworks on the unified data lake.
- Advocated and contributor to LinkedIn’s open source Datahub project
Senior Data Engineer
Jun 2020 - Mar 2023
• 2 yrs 10 mos
Senior Data Engineer
Drop Technologies
Jul 2018 - Apr 2020
• 1 yr 10 mos• Responsible for sprint planning, management, and accountable for overall team velocity and completion rate. Communicate frequently with stakeholders to prioritize efforts and manage expectations
• Led a strategic initiative by investigating new technologies (Snowflake, Databricks) that align with
our data strategy, performing POCs, negotiating contracts with vendors, and spearheading the implementation of production frameworks and data pipelines to create a new, scalable, high margin revenue stream
• Led efforts to integrate critical data sources into the data warehouse by designing and building ETL workflows using Python/Airflow and SQL which enabled advanced BI and analytics across multiple subject areas and drove key business decisions
• Knowledgeable on a data lake implementation using the lambda architecture which involved streaming CDC data from source databases into AWS S3 and creating reconciled views of the data using PySpark and EMR
• Led revamping efforts of the Looker BI and analytics environment by making the interface more user friendly, implementing data structures and pipelines that are more performant and flexible, and simplifying processes to enable the Analytics team to add, maintain, and govern event data

Big Data Engineer
Precima, Inc.
Sep 2015 - Jul 2018
• 2 yrs 11 mosAnalytics and Architecture
• Standardized an end-to-end, Machine Learning retail pricing solution from modeling to optimization and to everyday price management
• Designed the data structures required to enable the entire pricing workflow and all the front-end reporting/visualization requirements, as well as finalized the calculations for business metrics
• Optimized the run-time performance of analytics algorithms relevant to the retail industry to improve assortment effectiveness and enable personalized cross sell/up sell offers
• Standardized a client-agnostic promotion solution which measures promotion effectiveness based on predicted and actual metrics
Technology
• Implemented the standard retail pricing solution with various cloud based technologies such as AWS Redshift, EC2, S3, AWS RDS, AWS Aurora, Postgres
• Championed the implementation and adoption of various database/data warehouse best practices across technical teams, such as those for Netezza, AWS Redshift, AWS RDS
Business Intelligence
• Collaborated with client teams to understand their business and their data, identified data elements and finalized data transformation logic required for BI reports
• Drafted detailed technical requirements documentation which includes important project decisions as well as end-to-end data mapping/transformation to provide a holistic view for developers and testers

Professional Services Consultant
Teradata
Jul 2012 - Sep 2015
• 3 yrs 3 mosClient Engagements:
Airline (2015)
• Gathered requirements and successfully bridged the gaps between source system SMEs, the reporting team, and the business
• Designed and created core table structures based on client requirements and also built and automated the corresponding ETL processes from source systems
• Designed and created semantic layer views and materialized fact tables based on reporting requirements and reporting tool capabilities and also built and automated the corresponding workflows
Retail (2014)
• Delivered the disaster recovery plan which detailed the client’s DR strategy and steps to recover from a disaster
• Managed the CR process and coordinated efforts between the internal project team and external client-side IT teams to ensure deployment requirements are met
• Delivered implementation plans that orchestrated efforts between various IT teams and successfully deployed all components of the DR solution into production
Retail (2014)
• Gathered business and IT requirements that drove the governance and technical design of the client's Data Lab environment
• Designed the client's Data Lab environment and completed the subsequent implementation
• Delivered high-level Data Lab presentations to VPs and detailed Data Lab and database mentoring to business end users
Retail (2013)
• Developed key fact tables and their respective ETL processes using SQL that are the basis for multiple BI reports
• Performed data validation with the client through QA and UAT and resolved issues when necessary
• Delegated code fixes and coordinated code promotions to production
Telecommunication (2012)
• Optimized performance on the system: reduced long-running ETL workflows by an average of 50% and overall ETL performance by 19%
• Developed dimensional views in SQL and ETL workflows for physical fact tables in the BI semantic layer and fixed related defects
• Assumed DBA responsibilities: performed database maintenance and resolved various database issues

DB2 QA Project Lieutenant
IBM
May 2010 - Aug 2011
• 1 yr 4 mos• Cooperated with a global team, maintained build stability in projects, and performed DB2 testing using Linux/Unix/Windows
• Designed and developed 9 Shell scripts with 4 colleagues which reduced total operating time by 27 – 39%
• Established a new Standard Operating Procedure which reduced total operating time by 1 – 1.5 hours per day
• Performed statistical analysis and initiated a machine virtualization project which resulted in 1 – 3 hours saved per day
• Achieved excellent ratings in quick learning, initiative, organization, and teamwork

IT Deployment Specialist
SGS Life Science Services
May 2008 - Aug 2008
• 4 mos• Analyzed how to integrate a software product costing multi-million dollars into the daily operations of the firm
• Gathered business requirements from business users
• Created the Standard Operating Procedure, validated the software, and trained 50 personnel to use the software
• Performed verification and validation testing on the software and discussed potential auditing problems with global QA
• Combined 6 Excel worksheets into 1 master worksheet and implemented 17 new functions and features
• Achieved excellent ratings in personality, diligence, and competence