Town Hall History
A list of previous Town Halls, their planned schedule, and the recording of the meeting.
01/26/2023
Agenda
- What’s to Come - Q1 2023 Roadmap: Data Products, Data Contracts and more
 - Community Case Study - Notion: Automating annotations and metadata propagation
 - Community Contribution - Grab: Improvements to documentation editing
 - Simplifying DataHub - Removing Schema Registry requirement and introducing DataHub Lite
 
01/05/2023
Agenda
- DataHub Community: 2022 in Review - Our Community of Data Practitioners is one of a kind. We’ll take the time to celebrate who we are, what we’ve built, and how we’ve collaborated in the past 12 months.
 - Search Improvements - Learn how we’re making the Search experience smarter and faster to connect you with the most relevant resources during data discovery.
 - Removing Schema Registry Requirement - Hear all about ongoing work to simplify the DataHub deployment process.
 - Smart Data Profiling - We’re making big improvements to data profiling! Smart data profiling will reduce processing time by only scanning datasets that have recently changed.
 - Sneak Peek: Time-based Lineage - Get a preview of how you’ll soon be able to trace lineage between datasets across different points in time to understand how interdependencies have evolved.
 - Sneak Peek: Chrome Extension - Soon, you’ll be able to quickly access rich metadata from DataHub while exploring resources in Looker via our upcoming Chrome Extension.
 
12/01/2022
Agenda
November Town Hall (in December!)
- Community Case Study - The Pinterest Team will share how they have integrated DataHub + Thrift and extended the Metadata Model with a Data Element entity to capture semantic types.
 - NEW! Ingestion Quickstart Guides - DataHub newbies, this one is for you! We’re rolling out ingestion quickstart guides to help you quickly get up and running with DataHub + Snowflake, BigQuery, and more!
 - NEW! In-App Product Tours - We’re making it easier than ever for end-users to get familiar with all that DataHub has to offer - hear all about the in-product onboarding resources we’re rolling out soon!
 - DataHub UI Navigation and Performance - Learn all about upcoming changes to our user experience to make it easier (and faster!) for end users to work within DataHub.
 - Sneak Peek! Manual Lineage via the UI - The Community asked and we’re delivering! Soon you’ll be able to manually add lineage connections between Entities in DataHub.
 - NEW! Slack + Microsoft Teams Integrations - Send automated alerts to Slack and/or Teams to keep track of critical events and changes within DataHub.
 - Hacktoberfest Winners Announced - We’ll recap this year’s Hacktoberfest and announce three winners of a $250 Amazon gift card & DataHub Swag.
 
10/27/2022
Agenda
- Conquer Data Governance with Acryl Data’s Metadata Tests - Learn how to tackle Data Governance with incremental, automation-driven governance using Metadata Tests provided in Acryl Data’s managed DataHub offering
 - Community Case Study - The Grab Team shares how they are using DataHub for data discoverability, automated classification and governance workflows, data quality observability, and beyond!
 - Upcoming Ingestion Sources - We’ll tell you the ins and outs of our upcoming dbt Cloud and Unity Catalog connectors
 - Sneak Peek! Saved Views - Learn how you can soon use Saved Views to help end-users navigate entities in DataHub with more precision and focus
 - Performance Improvements - Hear about the latest upgrades to DataHub performance
 
9/29/2022
Agenda
- Column Level Lineage is here! - Demo of column-level lineage and impact analysis in the DataHub UI
 - Community Case Study - The Stripe Team shares how they leverage DataHub to power observability within their Airflow-based ecosystem
 - Sneak Peek! Automated PII Classification - Preview upcoming functionality to automatically identify data fields that likely contain sensitive data
 - Ingestion Improvements Galore - Improved performance and functionality for dbt, Looker, Tableau, and Presto ingestion sources
 
8/25/2022
Agenda
- Community Case Study - The Etsy Team shares their journey of adopting DataHub
 - Looker & DataHub Improvements - surface the most relevant Looks and Dashboards
 - Home Page Improvements to tailor the Browse experience
 - Unified Ingestion Summaries - View live logs for UI-based ingestion and see historical ingestion reports across CLI and UI-based ingestion
 - Patch Support - Native support for PATCH in the metadata protocol to support efficient updates to add & remove owners, lineage, tags and more
 - Sneak Peek! Advanced Search
 
7/28/2022
Agenda
- Community Updates
 - Project Updates
 - Improvements to UI-Based Ingestion
 - Sneak Preview - Bulk Edits via the UI
 - Streamlined Metadata Ingestion
 - DataHub 201: Metadata Enrichment
 
6/30/2022
Agenda
- Community Updates
 - Project Updates
 - dbt Integration Updates
 - CSV Ingestion Support
 - DataHub 201 - Glossary Term Deep Dive
 
5/26/2022
Agenda
- Community Case Study: Hear how the G-Research team is using Cassandra as DataHub’s Backend
 - Creating & Editing Glossary Terms from the DataHub UI
 - DataHub User Onboarding via the UI
 - DataHub 201: Impact Analysis
 - Sneak Peek: Data Reliability with DataHub
 - Metadata Day Hackathon Winners
 
4/28/2022
Agenda
- Community Case Study: Hear from Included Health about how they are embedding external tools into the DataHub UI
 - New! Actions Framework: run custom code when changes happen within DataHub
 - UI Refresh for ML Entities
 - Improved deletion support for time-series aspects, tags, terms, & more
 - OpenAPI Improvements
 
3/31/2022
Agenda
- Community Case Study: Hear from Zendesk about how they are applying “shift left” principles by authoring metadata in their Protobuf schemas
 - RBAC Functionality: View-Based Policies
 - Schema Version History - surfacing the history of schema changes in DataHub's UI
 - Improvements to Airflow Ingestion, including Run History
 - Container/Domain-Level Property Inheritance
 - Delete API
 
2/25/2022
Agenda
- Lineage Impact Analysis - using DataHub to understand the impact of changes on downstream dependencies
 - Displaying Data Quality Checks in the UI
 - Roadmap update: Schema Version History & Column-Level Lineage
 - Community Case Study: Managing Lineage via YAML
 
1/28/2022
Agenda
- Community & Roadmap Updates by Maggie Hays (Acryl Data)
 - Project Updates by Shirshanka Das (Acryl Data)
 - Community Case Study: Adding Dataset Transformers by Eric Cooklin (Stash)
 - Demo: Data Domains & Containers by John Joyce (Acryl Data)
 - DataHub Basics — Data Profiling & Usage Stats 101 by Maggie Hays & Tamás Németh (Acryl Data)
 - Demo: Spark Lineage by Mugdha Hardikar (GS Lab) & Shirshanka Das
 
12/17/2021
Agenda
- Community & Roadmap Updates by Maggie Hays (Acryl Data)
 - Project Updates by Shirshanka Das (Acryl Data)
 - 2021 DataHub Community in Review by Maggie Hays
 - DataHub Basics -- Users, Groups, & Authentication 101 by Pedro Silva (Acryl Data)
 - Sneak Peek: UI-Based Ingestion by John Joyce (Acryl Data)
 - Case Study — DataHub at Grofers by Shubham Gupta
 - Top DataHub Contributors of 2021 - Maggie Hays
 - Final Surprise! We Interviewed a 10yo and a 70yo about DataHub
 
11/19/2021
Agenda
- Community & Roadmap Updates by Maggie Hays (Acryl Data)
 - Project Updates by Shirshanka Das (Acryl Data)
 - DataHub Basics -- Lineage 101 by John Joyce & Surya Lanka (Acryl Data)
 - Introducing No-Code UI by Gabe Lyons & Shirshanka Das (Acryl Data)
 - DataHub API Authentication by John Joyce (Acryl Data)
 - Case Study: LinkedIn pilot to extend the OSS UI by Aikepaer Abuduweili & Joshua Shinavier
 
10/29/2021
Agenda
- DataHub Community & Roadmap Update - Maggie Hays (Acryl Data)
 - October Project Updates - Shirshanka Das (Acryl Data)
 - Introducing Recommendations - John Joyce & Dexter Lee (Acryl Data)
 - Case Study: DataHub @ hipages - Chris Coulson (hipages)
 - Data Profiling Improvements - Surya Lanka & Harshal Sheth (Acryl Data)
 - Lineage Improvements & BigQuery Dataset Lineage by Gabe Lyons & Varun Bharill (Acryl Data)
 
9/24/2021
Agenda
- Project Updates and Callouts by Shirshanka
- GraphQL Public API Annoucement
 
 - Demo: Faceted Search by Gabe Lyons (Acryl Data)
 - Stateful Ingestion by Shirshanka Das & Surya Lanka (Acryl Data)
 - Case-Study: DataHub @ Adevinta by Martinez de Apellaniz
 - Recent Improvements to the Looker Connector by Shirshanka Das & Maggie Hays (Acryl Data)
 - Offline
- Foreign Key and Related Term Mapping by Gabe Lyons (Acryl Data) video
 
 
8/27/2021
Agenda
- Project Updates and Callouts by Shirshanka
- Business Glossary Demo
 - 0.8.12 Upcoming Release Highlights
 - Users and Groups Management (Okta, Azure AD)
 
 - Demo: Fine Grained Access Control by John Joyce (Acryl Data)
 - Community Case-Study: DataHub @ Warung Pintar and Redash integration by Taufiq Ibrahim (Bizzy Group)
 - New User Experience by John Joyce (Acryl Data)
 - Offline
- Performance Monitoring by Dexter Lee (Acryl Data) video
 
 
7/23/2021
Agenda
- Project Updates by Shirshanka
- Release highlights
 
 - Deep Dive: Data Observability: Phase 1 by Harshal Sheth, Dexter Lee (Acryl Data)
 - Case Study: Building User Feedback into DataHub by Melinda Cardenas (NY Times)
 - Demo: AWS SageMaker integration for Models and Features by Kevin Hu (Acryl Data)
 
6/25/2021
Agenda
- Project Updates by Shirshanka
- Release notes
 - RBAC update
 - Roadmap for H2 2021
 
 - Demo: Table Popularity powered by Query Activity by Harshal Sheth (Acryl Data)
 - Case Study: Business Glossary in production at Saxo Bank by Sheetal Pratik (Saxo Bank), Madhu Podila (ThoughtWorks)
 - Developer Session: Simplified Deployment for DataHub by John Joyce, Gabe Lyons (Acryl Data)
 
5/27/2021
Agenda
- Project Updates by Shirshanka - 10 mins
- 0.8.0 Release
 - AWS Recipe by Dexter Lee (Acryl Data)
 
 - Demo: Product Analytics design sprint (Maggie Hays (SpotHero), Dexter Lee (Acryl Data)) - 10 mins
 - Use-Case: DataHub on GCP by Sharath Chandra (Confluent) - 10 mins
 - Deep Dive: No Code Metadata Engine by John Joyce (Acryl Data) - 20 mins
 - General Q&A and closing remarks
 
4/23/2021
Agenda
- Welcome - 5 mins
 - Project Updates by Shirshanka - 10 mins
- 0.7.1 Release and callouts (dbt by Gary Lucas)
 - Product Analytics design sprint announcement (Maggie Hayes)
 
 - Use-Case: DataHub at DefinedCrowd (video) by Pedro Silva - 15 mins
 - Deep Dive + Demo: Lineage! Airflow, Superset integration (video) by Harshal Sheth and Gabe Lyons - 10 mins
 - Use-Case: DataHub Hackathon at Depop (video) by John Cragg - 10 mins
 - Observability Feedback share out - 5 mins
 - General Q&A and closing remarks - 5 mins
 
3/19/2021
Agenda
- Welcome - 5 mins
 - Project Updates (slides) by Shirshanka - 10 mins
- 0.7.0 Release
 - Project Roadmap
 
 - Demo Time: Themes and Tags in the React App! by Gabe Lyons - 10 mins
 - Use-Case: DataHub at Wolt (slides) by Fredrik and Matti - 15 mins
 - Poll Time: Observability Mocks! (slides) - 5 mins
 - General Q&A from sign up sheet, slack, and participants - 10 mins
 - Closing remarks - 5 mins
 
2/19/2021
Agenda
- Welcome - 5 mins
 - Latest React App Demo! (video) by John Joyce and Gabe Lyons - 5 mins
 - Use-Case: DataHub at Geotab (slides,video) by John Yoon - 15 mins
 - Tech Deep Dive: Tour of new pull-based Python Ingestion scripts (slides,video) by Harshal Sheth - 15 mins
 - General Q&A from sign up sheet, slack, and participants - 15 mins
 - Closing remarks - 5 mins
 
1/15/2021
Agenda
- Announcements - 2 mins
 - Community Updates (video) - 10 mins
 - Use-Case: DataHub at Viasat (slides,video) by Anna Kepler - 15 mins
 - Tech Deep Dive: GraphQL + React RFCs readout and discussion (slides ,video) by John Joyce and Arun Vasudevan - 15 mins
 - General Q&A from sign up sheet, slack, and participants - 15 mins
 - Closing remarks - 3 mins
 - General Q&A from sign up sheet, slack, and participants - 15 mins
 - Closing remarks - 5 minutes
 
12/04/2020
Agenda
- Quick intro - 5 mins
 - Why did Grofers choose DataHub for their data catalog? by Shubham Gupta - 15 minutes
 - DataHub UI development - Part 2 by Charlie Tran (LinkedIn) - 20 minutes
 - General Q&A from sign up sheet, slack, and participants - 15 mins
 - Closing remarks - 5 minutes
 
11/06/2020
Agenda
- Quick intro - 5 mins
 - Lightning talk on Metadata use-cases at LinkedIn by Shirshanka Das (LinkedIn) - 5 mins
 - Strongly Consistent Secondary Index (SCSI) in GMA, an upcoming feature by Jyoti Wadhwani (LinkedIn) - 15 minutes
 - DataHub UI overview by Ignacio Bona (LinkedIn) - 20 minutes
 - General Q&A from sign up sheet, slack, and participants - 10 mins
 - Closing remarks - 5 minutes
 
09/25/2020
Agenda
- Quick intro - 5 mins
 - Data Discoverability at SpotHero by Maggie Hays (SpotHero) - 20 mins
 - Designing the next generation of metadata events for scale by Chris Lee (LinkedIn) - 15 mins
 - General Q&A from sign up sheet, slack, and participants - 15 mins
 - Closing remarks - 5 mins
 
08/28/2020
Agenda
- Quick intro - 5 mins
 - Data Governance look for a Digital Bank by Sheetal Pratik (Saxo Bank) - 20 mins
 - Column level lineage for datasets demo by Nagarjuna Kanamarlapudi (LinkedIn) - 15 mins
 - General Q&A from sign up sheet and participants - 15 mins
 - Closing remarks - 5 mins
 
07/31/20
Agenda
- Quick intro - 5 mins
 - Showcasing new entities onboarded to internal LinkedIn DataHub (Data Concepts, Schemas) by Nagarjuna Kanamarlapudi (LinkedIn) - 15 mins
 - Showcasing new Lineage UI in internal LinkedIn DataHub By Ignacio Bona (LinkedIn) - 10 mins
 - New RFC Process by John Plaisted (LinkedIn) - 2 mins
 - Answering questions from the signup sheet - 13 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins
 
06/26/20
Agenda
- Quick intro - 5 mins
 - Onboarding Data Process entity by Liangjun Jiang (Expedia) - 15 mins
 - How to onboard a new relationship to metadata graph by Kerem Sahin (Linkedin) - 15 mins
 - Answering questions from the signup sheet - 15 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins
 
05/29/20
Agenda
- Quick intro - 5 mins
 - How to add a new aspect/feature for an existing entity in UI by Charlie Tran (LinkedIn) - 10 mins
 - How to search over a new field by Jyoti Wadhwani (LinkedIn) - 10 mins
 - Answering questions from the signup sheet - 15 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins
 
04/17/20
Agenda
- Quick intro - 5 mins
 - DataHub Journey with Expedia Group by Arun Vasudevan (Expedia) - 10 mins
 - Deploying DataHub using Nix by Larry Luo (Shanghai HuaRui Bank) - 10 mins
 - Answering questions from the signup sheet - 15 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins
 
04/03/20
- Agenda
- Quick intro - 5 mins
 - Creating Helm charts for deploying DataHub on Kubernetes by Bharat Akkinepalli (ThoughtWorks) - 10 mins
 - How to onboard a new metadata aspect by Mars Lan (LinkedIn) - 10 mins
 - Answering questions from the signup sheet - 15 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins
 
 
03/20/20
Agenda
- Quick intro - 5 mins
 - Internal DataHub demo - 10 mins
 - What's coming up next for DataHub (what roadmap items we are working on) - 10 mins
 - Answering questions from the signup sheet - 15 mins
 - Questions from the participants - 10 mins
 - Closing remarks - 5 mins