Blog Article
March 25, 2025
Shift Left Data Conference
March 30, 2025
Data Quality
March 5, 2024
August 14, 2024
Videos
Watch a short video demonstrating how gable prevents PII from being shared with a large language model such as those from OpenAI
May 30, 2025
Learn how to assess and improve your data platform with a data platform maturity model to drive business growth and innovation.
May 23, 2025
Shift‑left governance that keeps downstream stakeholders safe and productive.
May 19, 2025
Due to the growing industry needs for high quality data, data platform modernization is going from "would be nice" to "need to do now." Learn exactly why.
April 28, 2025
The rapid rise of AI has dramatically elevated the value and strategic importance of data, transforming how upstream software engineers perceive and interact with data workflows. In this expert-led panel, industry leaders will share their experiences and insights into effectively bridging the gap between data teams and software engineers. They will discuss practical strategies for proactively managing data infrastructure, enhancing collaboration, and ensuring high-quality data to support advanced AI-driven development initiatives.
April 1, 2025
Good Data and not Big Data is becoming more important in today's ecosystem. Machine Learning models rely on good quality data to make their model training more efficient and effective. We have traditionally applied Data Quality checks and balances in manual, centralized way, putting a lot of onus on our customers. Shifting Left Data Quality will bring the data quality checks closer to where data is being created, while preventing bad data from flowing downstream. Also auto-detecting, recommending and auto-enforcing data quality rules will make our customers job easier, while creating a more mature and robust data ecosystem.
In healthcare technology, protecting patient privacy while scaling data operations requires reimagining where quality and governance live. This presentation explores Helix's journey of shifting critical processes left in its precision medicine business—from implementing automated data classification and privacy workflows to enlisting cross-functional expertise in refining operational workflows. For clinical data management, we've partnered with healthcare systems to implement OMOP standards and data contracts at the source, creating a robust foundation for research and commercial opportunities. Through practical examples, we'll demonstrate how this upstream approach has transformed our data operations, encouraged internal alignment, and strengthened partner relationships.
Data teams increasingly embrace software engineering practices to address quality and integration challenges, yet friction remains between software and data teams. This talk explores why standard practices alone aren’t enough and introduces the concept of the “Data-Conscious Software Engineer,” an emerging role critical to bridging these organizational divides. Attendees will learn how identifying and empowering engineers who deeply understand both software development and data workflows can foster stronger collaboration, improve data quality, and drive organizational change toward treating data as a strategic asset.
High-quality, governed, and performant data from the outset is vital for agile, trustworthy enterprise AI systems. Traditional approaches delay addressing data quality and governance, causing inefficiencies and rework. Apache Iceberg, a modern table format for data lakes, empowers organizations to "Shift Left" by integrating data management best practices earlier in the pipeline to enable successful AI systems. This session covers how Iceberg's schema evolution, time travel, ACID transactions, and Git-like data branching allow teams to validate, version, and optimize data at its source. Attendees will learn to create resilient, reusable data assets, streamline engineering workflows, enforce governance efficiently, and reduce late-stage transformations—accelerating analytics, machine learning, and AI initiatives.