Sangalo Mwenyinyo-| Private Bag, Nairobi Kenya
EDUCATION
MIT
MicroMasters Program In
Statistics And Data Science |
Expected Aug 2025 | Cambridge
Massachusetts
MOUNT KENYA UNIVERSITY
Clinical Medicine and
Community Health | 2022 | Thika
Kenya
CERTIFICATIONS
GOOGLE CLOUD
Google Cloud Certified
Professional Cloud Architect |
Jan 2024
Link to Certificate
Google Cloud Certified
Professional Data Engineer |
Sept 2023
Link to Certificate
DBT
Dbt Fundamentals | April 2023
Link to Certificate
SKILLS
PROGRAMMING LANGUAGES
Confident
• Python • Sql • DAX • M
Familiar
• JavaScript
EXPERIENCE
VILLAGE HEALTH WORKS | Cloud Architect & Data Engineer •
full-time (remote)
March 2023 - present | NewYork USA
• I Implemented a data engineering framework leveraging Infrastructure as Code
(IaC) principles, automating multi-cloud infrastructure (GCP, AWS), optimizing
data transformation workflows using dbt, and ensuring version control with
Git.This setup facilitated efficient data processing pipelines and streamlined
development and deployment.
• I designed and implemented a modern data warehousing solution in BigQuery,
integrating diverse data sources and establishing efficient data pipelines,
ensuring data integrity and accessibility for analytics.
PRESCOTT DATA |CTO & Data Engineer • part-time (remote)
Feb 2021 - Present | Nairobi Kenya
• Oversee all technology operations and strategies.
• Drive innovation, manage IT infrastructure, and lead the organization’s
technical initiatives to support business growth and deliver exceptional client
experiences.
MEDICAL MISSIONS | Data Engineer • contract (remote)
Sept 2021 - Aug 2022 | Juba South Sudan
• Infrastructure design and deployment in AWS.
• Designed and configured a MySQL database, leveraging it to generate
actionable insights with MS Power BI that drove value and informed
decision-making.
PROJECTS
BUCHER MUNICIPAL | Data Engineer • contract (remote)
Dec 2022 - Apr 2023 | Zurich Switzerland
• Engineered dashboards in Looker Studio, translating complex requirements
into intuitive designs for data visualization.
• Integrated STAT site SOV connector and historical keywords connector.
• Analyzed share of voice and keyword data from various STAT sites.
EDISON HOUSE | BI Engineer
Dec 2022 - Mar 2023 | Utah USA
• AWS • GCP • Azure • Docker •
Kubernetes • Git • Kafka • Spark •
• Designed and executed AWS infrastructure deployments, streamlined MySQL
database performance, and crafted compelling data visualizations using Power
Hive • Terraform • Airflow • PySpark
BI.
• PyTorch • Scikit-Learn • Power BI •
Looker Studio • Metabase • Numpy •
AMILI | Data Analyst
Pandas • dbt • BigQuery • Jupyter
Aug 2022 - Sept 2022 | Singapore
notebooks • Grafana
TOOLS
• Data collection and standardization on a study on Akkermansia Muciniphila.
LINKS
Github:// Sangalo20
LinkedIn:// Sangalo-mwenyinyo
Stackoverflow:// Innocent-sangalo
OTHER FREELANCE PROJECTS
• I spearheaded the development of a comprehensive architectural blueprint for the migration of on-premise
systems to Google Cloud for a Canadian parastatal. Additionally, I formulated a robust data strategy and
governance framework to optimize data management within the organization.
• I contributed to a healthcare initiative for an English hospital, entailing the analysis of patient data derived from
an internal drug study. Furthermore, I played a key role in conceiving and constructing a cutting-edge Gen AI
medical application, leveraging GPT-4 technology for the tokenization of medical data.
• I automated reporting processes by integrating Power BI and Python, creating an AI-driven solution with
sophisticated automated text analytics. Implemented a dedicated SQL pool on Azure Synapse for seamless
integration with Power BI, facilitating the development of comprehensive business intelligence dashboards.