Latest Jocelyn Goldfein News
May 17, 2023
Mona & Max We have some exciting news to share - we’ve closed our Seed round with .406 Ventures, Zetta Venture Partners , Y Combinator , and some exceptional angel investors like, Manish Sood (Founder and CTO, Reltio), Sam Ramji (Chief Strategy Officer, DataStax) and Venkat Varadachary (CEO, Zenon | Former EVP/CDAO @ Amex). We feel fortunate to partner with Graham Brooks and Jocelyn Goldfein on our journey. They are the best partners we could have asked for as a data company with an ML-first approach. This is a big milestone for our team. We wanted to take this moment to share our thoughts on Telmai’s opportunity, and our plans for the future. Telmai is a no-code real-time data quality analysis and monitoring platform. Our product automatically and proactively detects data quality issues as data is being ingested. Our human-in-the-loop machine learning engine enables Telmai users to define correct versus incorrect data -definitions that in turn enable proactive and autonomous monitoring and alerting of data quality issues. Company momentum Since inception in November 2020, we’ve been working hard to build visionary products and spread the word about our unique approach to data reliability. We have been fortunate to work with early design partners like Dun & Bradstreet and Myers-Holum, who share our vision for the future of data quality. Thanks to partners like them, and the thousands of prospective users we’ve been fortunate to interview, Telmai launched its MVP this month (October 2021). More important than our category-leading product is our world-class team. We have been fortunate to recruit a lean but exceptional technical group who share a deep belief in Telmai’s vision of the future. The next chapter This seed funding round has enabled us to grow our team and to continue building Telmai’s leading data-quality platform for the modern data stack. We are excited to continue this journey with an incredible group of institutional and angel investors who believe in our mission. Investor Quotes "As an investor focused on data and analytics for the past 15 years, our portfolio and our network of Chief Data Officers have continually highlighted data quality and trust as key challenges to realizing the promise of big data analytics and machine learning. The biggest challenge around data quality is the historic gap between engineering-focused data quality tools and business users, who understand the data, how it is being used and the expectations around data reliability. Telmai is not only eliminating that gap but also addressing both IT and business' most pressing needs in an infinitely-scalable, integrated data observability tool that establishes trust in data.” “The AI and machine learning platform shift is well under way, and ensuring data quality - efficiently, in production, and at scale - will be a prerequisite for every product and every business decision in the decades to come. We're delighted to partner with Telm.ai's founders, who acutely understand the challenge of maintaining high quality data and have built an industry leading product, itself powered by data and machine learning, to help their customers achieve it.” - Jocelyn Goldfein , Managing Director , Zetta Venture Partners We want to thank our customers and all the people who made it possible for us to reach this point – we are beyond excited to continue Telmai’s journey. Onwards 🚀 Data profiling helps organizations understand their data, identify issues and discrepancies, and improve data quality. It is an essential part of any data-related project and without it data quality could impact critical business decisions, customer trust, sales and financial opportunities. To get started, there are four main steps in building a complete and ongoing data profiling process: What are the different kinds of data profiling? Data profiling falls into three major categories: structure discovery, content discovery, and relationship discovery. While they all help in gaining more understanding of the data, the type of insights they provide are different: Structure discovery analyzes that data is consistent, formatted correctly, and well structured. For example, if you have a ‘Date’ field, structure discovery helps you see the various patterns of dates (e.g., YYYY-MM-DD or YYYY/DD/MM) so you can standardize your data into one format. Structure discovery also examines simple and basic statistics in the data, for example, minimum and maximum values, means, medians, and standard deviations. Content discovery looks more closely into the individual attributes and data values to check for data quality issues. This can help you find null values, empty fields, duplicates, incomplete values, outliers, and anomalies. For example, if you are profiling address information, content discovery helps you see whether your ‘State’ field contains the two-letter abbreviation or the fully spelled out city names, both, or potentially some typos. Content discovery can also be a way to validate databases with predefined rules. This process helps find ways to improve data quality by identifying instances where the data does not conform to predefined rules. For example, a transaction amount should never be less than $0. Relationship discovery discovers how different datasets are related to each other. For example, key relationships between database tables, or lookup cells in a spreadsheet. Understanding relationships is most critical in designing a new database schema, a data warehouse, or an ETL flow that requires joining tables and data sets based on those key relationships. 1. Data Collection Start with data collection. Gather data from various sources and extract it into a single location for analysis. If you have multiple sources, choose a centralized data profiling tool (see our recommendation in the conclusion) that can easily connect and analyze all your data without having you do any prep work. 2. Discovery & Analysis Now that you have collected your data for analysis, it's time to investigate it. Depending on your use case, you may need structure discovery, content discovery, relationship discovery, or all three. If data content or structure discovery is important for your use case, make sure that you collect and profile your data in its entirety and do not use samples as it will skew your results. Use visualizations to make your discovery and analysis more understandable. It is much easier to see outliers and anomalies in your data using graphs than in a table format. 3. Documenting the Findings Create a report or documentation outlining the results of the data profiling process, including any issues or discrepancies found. Use this step to establish data quality rules that you may not have been aware of. For example, a United States ZIP code of 94061 could have accidentally been typed in as 94 061 with a space in the middle. Documenting this issue could help you establish new rules for the next time you profile the data. 4. Data Quality Monitoring Now that you know what you have, the next step is to make sure you correct these issues. This may be something that you can correct or something that you need to flag for upstream data owners to fix. After your data profiling is done and the system goes live, your data quality assurance work is not done – in fact, it's just getting started. Data constantly changes. If unchecked, data quality defects will continue to occur, both as a result of system and user behavior changes. Automated tools can help you save time and resources and ensure accuracy in the process. Unfortunately, traditional data profiling tools offered by legacy ETL and database vendors are complex and require data engineering and technical skills. They also only handle data that is structured and ready for analysis. Semi-structured data sets, nested data formats, blob storage types, or streaming data do not have a place in those solutions. Today organizations that deal with complex data types or large amounts of data are looking for a newer, more scalable solution. That’s where a data observability tool like Telmai comes in. Telmai is built to handle the complexity that data profiling projects are faced with today. Some advantages include centralized profiling for all data types, a low-code no-code interface, ML insights, easy integration, and scale and performance.
Jocelyn Goldfein Investments
Jocelyn Goldfein has made 7 investments. Their latest investment was in StackShare as part of their Seed VC on December 31, 2016.
Jocelyn Goldfein Investments Activity
Seed VC - II