{"id":2673,"date":"2022-01-27T18:25:26","date_gmt":"2022-01-28T02:25:26","guid":{"rendered":"https:\/\/live-cometml.pantheonsite.io\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/"},"modified":"2022-01-27T18:25:26","modified_gmt":"2022-01-28T02:25:26","slug":"comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22","status":"publish","type":"post","link":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/","title":{"rendered":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26)"},"content":{"rendered":"\n<p>Welcome to another recap of the Comet ML Office Hours, powered by <a href=\"https:\/\/theartistsofdatascience.fireside.fm\/\">The Artists of Data Science<\/a>! This week we&#8217;re covering Session 4 of our new series. This session took place Jan. 26th, 2022 and we were joined by Jimmy Whitaker of Pachyderm, Dr. Abe Gong of Great Expectations, and Data Governance Analyst and Heartbeat contributor Matt Blasa.<\/p>\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n<p>As a reminder, we\u2019d love to see any and all of you at these fifty minute sessions\u2014so feel free to <a href=\"https:\/\/comet-ml.zoom.us\/meeting\/register\/tZItf-urrzMoE9N-HWSIK68qfI1eDB_jlUeN\">register for upcoming Office Hours sessions here<\/a>! As always, there&#8217;s a lot more in the full session (which you can <a href=\"https:\/\/youtu.be\/AAScd3pZBs4\">find on our YouTube channel<\/a>), so be sure to check it out, alongside clips from roundtables, webinars, and previous Office Hours.<\/p>\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n<h2 class=\"wp-block-heading\">All About Data<\/h2>\n\n\n\n<p>This session was all about data &#8211; understanding, validating, versioning, and engineering it. Because without good data, your model won&#8217;t perform optimally and could disappoint down the road.<\/p>\n\n\n\n<p>Matt Blasa of Brinks Home Security discussed his role as a Data Governance Analyst and how that plays into the larger ML experiment pipeline.<\/p>\n<p><iframe loading=\"lazy\" title=\"Comet Office Hours: Matt Blasa on Data Governance\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/U_FFZTZ6rEQ?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n<h2 class=\"wp-block-heading\">Data Curation<\/h2>\n\n\n\n<p>One thing that did come up in the course of the data discussion was the idea of &#8220;Data Curators.&#8221; This was a new term for Harpreet and inspired Dr. Abe Gong to bring up Emilie Schario&#8217;s suggestion of taking the term &#8220;Data Scientist&#8221; and exploding it into multiple job titles.<\/p>\n\n\n\n<p>Hear more from Dr. Gong in the clip below.<\/p>\n<p><iframe loading=\"lazy\" title=\"Comet Office Hours: Dr. Abe Gong on Data Curators\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/zenKGD633nk?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n<h2 class=\"wp-block-heading\">Cautionary Tales &#8211; And Plenty of Them<\/h2>\n\n\n\n<p>Throughout the discussion a number of stories came up about &#8220;upside down models&#8221; and what happens if you don&#8217;t properly version your data.<\/p>\n\n\n\n<p>One story that came up from Jimmy Whitaker was about digits in NLP models. Specifically, a transcription model that was outputting different things for dates &#8211; sometimes written as digits and sometimes written out as words.<\/p>\n\n\n\n<p>Check out the clip below for the full story:<\/p>\n<p><iframe loading=\"lazy\" title=\"Comet Office Hours  Jimmy Whitaker on Data Versioning\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/gopjQRexlss?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n\n\n<hr class=\"wp-block-separator\" \/>\n\n\n<h2 class=\"wp-block-heading\">Resources<\/h2>\n\n\n\n<p>Make sure to read Matt&#8217;s articles on <a href=\"https:\/\/heartbeat.comet.ml\/integrating-comet-and-azure-databricks-4ec97703a2fe\">Heartbeat<\/a>.<\/p>\n\n\n\n<p>As always, there&#8217;s more to be discussed and discovered. Check out Comet&#8217;s contributor-led publication <a href=\"https:\/\/heartbeat.comet.ml\/\">Heartbeat<\/a> as well as our YouTube, Twitter, and LinkedIn for more great information.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/heartbeat.comet.ml\/comet-office-hours-bookshelf-a463abdf4a5e\">Comet Office Hours Bookshelf<\/a><\/li>\n<li>Office Hours <a href=\"https:\/\/www.youtube.com\/playlist?list=PLX9GmL8cVn_wkVp7g942xiHrKrT_zVpLU\">YouTube Playlist<\/a><\/li>\n<li>Follow Comet ML on <a href=\"http:\/\/twitter.com\/Cometml\">Twitter<\/a><\/li>\n<li>Connect with Comet on <a href=\"https:\/\/www.linkedin.com\/company\/comet-ml\">LinkedIn<\/a><\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Curious to Learn more? Join Us!<\/h2>\n\n\n\n<p>We run these virtual Office Hours every Wednesday at 11am EST (New York, NY). Completely free to attend and participate, and we&#8217;d love to see any and all of you there! We&#8217;ve got a great series planned and welcome questions for Harpreet or any of our guests via email to emilie@comet.ml.<\/p>\n\n\n\n<p>But most importantly:<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><a href=\"http:\/\/bit.ly\/comet-ml-oh\">Register for Comet Office Hours<\/a><\/h3>\n\n\n\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Notes from the third session of a brand new Office Hours series: Seven Simple Steps to Standardizing the Experiment discussing data with guests Jimmy Whitaker, Dr. Abe Gong, and Matt Blasa.<\/p>\n","protected":false},"author":112,"featured_media":2671,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"customer_name":"","customer_description":"","customer_industry":"","customer_technologies":"","customer_logo":"","footnotes":""},"categories":[13],"tags":[],"coauthors":[131],"class_list":["post-2673","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-office-hours"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.9 (Yoast SEO v25.9) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>7 Simple Steps to Standardizing the ML Experiment (Jan. 26) - Comet<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"7 Simple Steps to Standardizing the ML Experiment (Jan. 26)\" \/>\n<meta property=\"og:description\" content=\"Notes from the third session of a brand new Office Hours series: Seven Simple Steps to Standardizing the Experiment discussing data with guests Jimmy Whitaker, Dr. Abe Gong, and Matt Blasa.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/\" \/>\n<meta property=\"og:site_name\" content=\"Comet\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/cometdotml\" \/>\n<meta property=\"article:published_time\" content=\"2022-01-28T02:25:26+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Claire Pena\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Cometml\" \/>\n<meta name=\"twitter:site\" content=\"@Cometml\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Claire Pena\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26) - Comet","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/","og_locale":"en_US","og_type":"article","og_title":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26)","og_description":"Notes from the third session of a brand new Office Hours series: Seven Simple Steps to Standardizing the Experiment discussing data with guests Jimmy Whitaker, Dr. Abe Gong, and Matt Blasa.","og_url":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/","og_site_name":"Comet","article_publisher":"https:\/\/www.facebook.com\/cometdotml","article_published_time":"2022-01-28T02:25:26+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg","type":"image\/jpeg"}],"author":"Claire Pena","twitter_card":"summary_large_image","twitter_creator":"@Cometml","twitter_site":"@Cometml","twitter_misc":{"Written by":"Claire Pena","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#article","isPartOf":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/"},"author":{"name":"Claire Pena","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/b73b3ffc304cf8bec8866340329c5e89"},"headline":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26)","datePublished":"2022-01-28T02:25:26+00:00","mainEntityOfPage":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/"},"wordCount":465,"publisher":{"@id":"https:\/\/www.comet.com\/site\/#organization"},"image":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#primaryimage"},"thumbnailUrl":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg","articleSection":["Office Hours"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/","url":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/","name":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26) - Comet","isPartOf":{"@id":"https:\/\/www.comet.com\/site\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#primaryimage"},"image":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#primaryimage"},"thumbnailUrl":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg","datePublished":"2022-01-28T02:25:26+00:00","breadcrumb":{"@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#primaryimage","url":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg","contentUrl":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2022\/06\/NoNames_Comet-Office-Hours-LI_TW.jpg","width":1200,"height":628,"caption":"Machine Learing Process Hosted by Harpreet Sahota"},{"@type":"BreadcrumbList","@id":"https:\/\/www.comet.com\/site\/blog\/comet-office-hoursseven-simple-steps-to-standardizing-the-ml-experiment1-26-22\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.comet.com\/site\/"},{"@type":"ListItem","position":2,"name":"7 Simple Steps to Standardizing the ML Experiment (Jan. 26)"}]},{"@type":"WebSite","@id":"https:\/\/www.comet.com\/site\/#website","url":"https:\/\/www.comet.com\/site\/","name":"Comet","description":"Build Better Models Faster","publisher":{"@id":"https:\/\/www.comet.com\/site\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.comet.com\/site\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.comet.com\/site\/#organization","name":"Comet ML, Inc.","alternateName":"Comet","url":"https:\/\/www.comet.com\/site\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/#\/schema\/logo\/image\/","url":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2025\/01\/logo_comet_square.png","contentUrl":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2025\/01\/logo_comet_square.png","width":310,"height":310,"caption":"Comet ML, Inc."},"image":{"@id":"https:\/\/www.comet.com\/site\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/cometdotml","https:\/\/x.com\/Cometml","https:\/\/www.youtube.com\/channel\/UCmN63HKvfXSCS-UwVwmK8Hw"]},{"@type":"Person","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/b73b3ffc304cf8bec8866340329c5e89","name":"Claire Pena","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/image\/6c42de20d82274b5bcc55f12d2480401","url":"https:\/\/secure.gravatar.com\/avatar\/0158b496f72fba29753917da405441fa923b21dec99134ee8818143fc4113fe4?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/0158b496f72fba29753917da405441fa923b21dec99134ee8818143fc4113fe4?s=96&d=mm&r=g","caption":"Claire Pena"},"url":"https:\/\/www.comet.com\/site\/blog\/author\/clairep\/"}]}},"_links":{"self":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts\/2673","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/users\/112"}],"replies":[{"embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/comments?post=2673"}],"version-history":[{"count":0,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts\/2673\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/media\/2671"}],"wp:attachment":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/media?parent=2673"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/categories?post=2673"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/tags?post=2673"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/coauthors?post=2673"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}