Skip to content
View Dalbee's full-sized avatar
:shipit:
Focusing
:shipit:
Focusing

Block or report Dalbee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Dalbee/README.md

👋 Hi there, I'm Dalbert (Dalbee)

Software Engineer - Data Analytics / Data Engineering | Industrial Data Architect | GxP & DevOps Specialist

I build high-integrity, distributed systems that bridge the gap between complex data and real-world operations. My expertise lies in designing scalable data pipelines and mission-critical HMIs for high-stakes industries like Finance, Biotech, Energy, and Aerospace.


🚀 Core Focus & Expertise

  • Industrial Data Engineering: Architecting Medallion-style Lakehouses for multi-million row telemetry.
  • Financial Analytics Engineering: Building high-fidelity ROI models and prescriptive funnel analytics for banking and digital sales.
  • Mission-Critical Systems: Expertise in GxP-compliant software, 21 CFR Part 11 integrity, and high-availability DevOps.
  • Distributed Architectures: Specialized in "Triad" microservice patterns (.NET, Python, React).
  • Domain Interest: Exploring the intersection of Data Engineering and GNSS/CubeSat ground segments.

🛠️ Strategic Tech Stack

Category Technologies
Languages C# (.NET 10), Python (FastAPI/PySpark), TypeScript, SQL, SAS
Data Engineering Snowflake, Databricks, dbt, Airflow (Cosmos), Microsoft Fabric, Medallion Architecture
Cloud & DevOps Azure, Docker, GitHub Actions, CI/CD, Infrastructure as Code
Visualization & BI Power BI (DAX/DirectLake), Strategic Narratives, React, GxP-compliant UI/UX

🧪 Featured Production-Scale Projects

End-to-End Orchestrated Pipeline (Airflow + dbt + Snowflake + Power BI)

  • Orchestration & DevOps: Engineered a fully containerized (Docker) stack using Apache Airflow and Astronomer Cosmos to dynamically render dbt models as an integrated DAG with granular observability.
  • Medallion Transformation: Implemented a three-tier architecture in Snowflake, utilizing dbt for schema enforcement and a "Left Anti-Join" strategy in the Silver layer to isolate high-intent lost opportunities for re-targeting.
  • Data Contracts: Hardened the pipeline with automated dbt data tests (Unique, Not_Null) and asset lineage tracking to ensure "Zero-Defect" reporting for financial stakeholders.
  • Prescriptive BI: Developed an executive dashboard featuring a Strategic Channel Efficiency Matrix and a dynamic DAX-driven narrative engine that provides real-time investment recommendations based on portfolio variance.

Industrial Digital Twin & GxP-Compliant HMI

  • Architecture: Engineered a Decoupled Triad Architecture (React HMI ↔ Python SCADA Engine ↔ .NET 10 Compliance Service) to bridge historical data with live operational compliance.
  • Innovation: Integrated a physics-based Digital Twin linking Impeller RPM to Oxygen Transfer and predictive temperature modeling via moving-window linear regression.
  • Compliance: Built a dedicated .NET microservice for immutable audit trails (21 CFR Part 11) and a React-based HMI featuring deterministic pulsing alarms.

Enterprise Data Engineering & Power BI Analytics (Microsoft Fabric)

  • Business Impact: Identified €4.9M in potential cost recovery (81,744 MWh efficiency risk) through advanced Power BI (DAX) and Star-Schema modeling.
  • Scale: Implemented a full Medallion Architecture (Bronze/Silver/Gold) on OneLake using PySpark to process and transform millions of rows of operational telemetry.
  • Visualization: Developed executive-level Power BI dashboards utilizing calculation groups and DirectLake mode to track production vs. demand with sub-second interactivity.
  • Governance: Established a "Single Source of Truth" by computing KPIs upstream in Spark, secured via RLS/OLS, and automated through Fabric CI/CD pipelines.

End-to-End Medallion Pipeline (Databricks + dbt + Airflow + Power BI)

  • Architecture & Governance: Engineered a Medallion Pipeline (Bronze → Silver → Gold) in Databricks Unity Catalog, refactoring 180k+ rows of nested telemetry into an optimized Star Schema to migrate business logic from DAX to the Warehouse.
  • Analytics Engineering: Leveraged dbt-databricks to implement MD5 surrogate keys and "Inferred Dimensions," ensuring 100% referential integrity and a 40% increase in downstream report performance.
  • Orchestration & DevOps: Developed a Python-based Apache Airflow DAG for lifecycle management and implemented GitHub Actions CI/CD with encrypted Secret Management to automate code validation and secure cloud authentication.
  • Strategic BI Layer: Developed an integrated Power BI Executive Dashboard featuring prescriptive financial modeling (Waterfall Profit Bridge) and sub-second "Drill Down" capabilities into granular shipment-item details.
  • Quality Engineering: Hardened the platform with dbt 2.0 relationship tests and automated schema validation to eliminate data "leakage" and ensure "Zero-Defect" reporting for stakeholders.

🤝 Let's Connect & Collaborate

I’m always looking to bridge the gap between complex industrial data and actionable intelligence. Whether you want to talk IIoT architecture or piano sonatas, let’s chat!

  • Architecture Deep-Dives: Ask me about Digital Twins, Medallion Lakehouses (Fabric/Databricks), or maintaining GxP integrity in automated pipelines.
  • Aerospace & GNSS: I am actively seeking collaborations on GNSS signal processing and CubeSat ground segment telemetry.
  • Professional Hubs: * LinkedIn — For industry networking and architecture discussions.
    • Twitter/X — For tech updates and real-time insights.

⚡ Fun Fact

I balance the logic of data with the rhythm of music—I love cycling, urban hiking, singing, leading choral groups and playing the piano. Whether in a data pipeline or a piano sonata, "timing" is everything!

Pinned Loading

  1. conversion-stream-pipeline conversion-stream-pipeline Public

    Banking Digital Sales Analytics Pipeline End-to-End Orchestrated Pipeline (Airflow + dbt + Snowflake + Power BI)

    Python

  2. bioprocess-insights-platform bioprocess-insights-platform Public

    A Real-time Bioreactor Monitoring Dashboard for Industrial Fermentation

    TypeScript

  3. energy-analytics-fabric-bi energy-analytics-fabric-bi Public

    This repository contains a set of three projects that showcase modern data engineering, reporting, and governance capabilities using Microsoft Fabric. The work is structured to represent the respon…

    Python

  4. nexus-supply-chain-ops nexus-supply-chain-ops Public

    An end-to-end ELT pipeline refactoring raw supply chain data into a high-performance Star Schema using dbt-fusion and Databricks. Engineered to drive granular shipping performance analytics and tim…

    SQL

  5. Stock-Data-Analytics Stock-Data-Analytics Public

    IBM-Python-Project-for-Data-Science

    Jupyter Notebook 1

  6. Decision-trees-for-Scandinavian-Cuisines-Prediction Decision-trees-for-Scandinavian-Cuisines-Prediction Public

    Machine Learning on IBM Skills Network Labs. A place for you to practice the data science, machine learning, and AI skills you’re learning in your online courses. You have access to JupyterLab, Zep…

    Jupyter Notebook 1