{"id":7458,"date":"2023-09-12T16:00:28","date_gmt":"2023-09-13T00:00:28","guid":{"rendered":"https:\/\/live-cometml.pantheonsite.io\/?p=7458"},"modified":"2025-04-24T17:14:10","modified_gmt":"2025-04-24T17:14:10","slug":"human-in-the-loop-machine-learning","status":"publish","type":"post","link":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/","title":{"rendered":"Human-in-the-Loop Machine Learning"},"content":{"rendered":"\n<link rel=\"canonical\" href=\"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\">\n\n\n\n<div class=\"fh fi fj fk fl\">\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<figure class=\"mj mk ml mm mn mo mg mh paragraph-image\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg\" alt=\"\" width=\"625\" height=\"380\"><\/figure><div class=\"mg mh mi\"><picture><\/picture><\/div><figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\" data-selectable-paragraph=\"\">Inca Knot Writing Quipu-<a class=\"af mw\" href=\"https:\/\/en.wikipedia.org\/wiki\/Quipu\" target=\"_blank\" rel=\"noopener ugc nofollow\">Image Source<\/a><\/figcaption><\/figure>\n<h2 id=\"5225\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\">Is it necessary for humans to take part in the machine learning cycle?<\/h2>\n<p id=\"4716\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">Contrary to what the movies we watch show us, today\u2019s artificial intelligence (AI) cannot do everything and learn everything on its own. It primarily, and to a large extent, needs the feedback it receives from people.<\/p>\n<p id=\"252d\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">I would guess that almost 80% of machine learning (ML) applications today consist of supervised learning models. However, applications cover a wide range of uses. For example, autonomous vehicles are trained with many data points such as <em class=\"os\">\u2018pedestrian,\u2019<\/em> <em class=\"os\">\u2018moving vehicle,\u2019<\/em> <em class=\"os\">\u2018lane markings\u2019<\/em> so they can transport you safely. Your device understands you even when you command your home device to <em class=\"os\">\u2018volume up\u2019<\/em> or say it in different languages to a machine translation app. ML models must be trained with maybe thousands of hours, millions of data, to reach this kind of performance.<\/p>\n<blockquote class=\"ot ou ov\"><p id=\"ed6a\" class=\"nv nw os be b gm on ny nz gp oo ob oc ow op oe of ox oq oh oi oy or ok ol om fh bj\" data-selectable-paragraph=\"\">Annotation and active learning are the first step and cornerstones of the human-in-the-loop approach in AI\/ML.<\/p><\/blockquote>\n<p id=\"2082\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Human-in-the-loop (HITL) is a cycle that allows people to develop ML approaches that make life easier. You need to know how to get training data from people and get human feedback on all your data. But when you don\u2019t have the budget or time for that, you must find different ways to determine the correct data.<\/p>\n<p id=\"0c2a\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Transfer learning, one of these methods, ensures that we do not exceed a difficult point by adapting the existing ML models to our new task instead of starting from the beginning. Transfer learning has been popular for a while, so I won\u2019t go without mentioning it towards the end of the article. However, we will start with the issue of labeling, in which humans are included in the cycle.<\/p>\n<p id=\"e1b6\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Before we get into the labeling\/annotation issue, let\u2019s look at the principles for humans to be in the ML cycle.<\/p>\n<figure class=\"mj mk ml mm mn mo mg mh paragraph-image\">\n<div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:700\/0*pe9htSMfea27Ov14.png\" alt=\"\" width=\"700\" height=\"368\"><\/figure><div class=\"mg mh oz\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*pe9htSMfea27Ov14.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*pe9htSMfea27Ov14.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*pe9htSMfea27Ov14.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*pe9htSMfea27Ov14.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*pe9htSMfea27Ov14.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*pe9htSMfea27Ov14.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/format:webp\/0*pe9htSMfea27Ov14.png 1400w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*pe9htSMfea27Ov14.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*pe9htSMfea27Ov14.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*pe9htSMfea27Ov14.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*pe9htSMfea27Ov14.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*pe9htSMfea27Ov14.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*pe9htSMfea27Ov14.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/0*pe9htSMfea27Ov14.png 1400w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\" data-testid=\"og\"><\/picture><\/div>\n<\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/labelyourdata.com\/articles\/human-in-the-loop-in-machine-learning\" target=\"_blank\" rel=\"noopener ugc nofollow\">Human-in-the-loop ML benefits<\/a><\/figcaption>\n<\/figure>\n<h1 id=\"196b\" class=\"pe my fo be mz pf pg go nd ph pi gr nh pj pk pl pm pn po pp pq pr ps pt pu pv bj\" data-selectable-paragraph=\"\">Human-in-Loop Fundamentals for ML<\/h1>\n<p id=\"b2be\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">When humans and ML interact to solve one or more of the following processes, what we call a HITL begins to happen:<\/p>\n<ul class=\"\">\n<li id=\"8548\" class=\"nv nw fo be b gm on ny nz gp oo ob oc ni pw oe of nm px oh oi nq py ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Making ML more accurate<\/li>\n<li id=\"7a14\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Bringing ML to demanded accuracy faster<\/li>\n<li id=\"1fdf\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Helping people make better decisions<\/li>\n<li id=\"dbcc\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Making people more productive<\/li>\n<\/ul>\n<p id=\"2cd9\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Essentially, the data labeling process is simply an HITL process in which humans are involved in ML performance. Regardless of image, sound, text, or sensor data, a process similar to the figure below is required.<\/p>\n<figure class=\"mj mk ml mm mn mo mg mh paragraph-image\">\n<div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:700\/0*fI5gXiql-f6jGHKC.png\" alt=\"\" width=\"700\" height=\"406\"><\/figure><div class=\"mg mh qh\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*fI5gXiql-f6jGHKC.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*fI5gXiql-f6jGHKC.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*fI5gXiql-f6jGHKC.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*fI5gXiql-f6jGHKC.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*fI5gXiql-f6jGHKC.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*fI5gXiql-f6jGHKC.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/format:webp\/0*fI5gXiql-f6jGHKC.png 1400w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*fI5gXiql-f6jGHKC.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*fI5gXiql-f6jGHKC.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*fI5gXiql-f6jGHKC.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*fI5gXiql-f6jGHKC.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*fI5gXiql-f6jGHKC.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*fI5gXiql-f6jGHKC.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/0*fI5gXiql-f6jGHKC.png 1400w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\" data-testid=\"og\"><\/picture><\/div>\n<\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/www.manning.com\/books\/human-in-the-loop-machine-learning\" target=\"_blank\" rel=\"noopener ugc nofollow\">The logic of the HITL process to predict labels in data<\/a><\/figcaption>\n<\/figure>\n<h1 id=\"75e0\" class=\"pe my fo be mz pf pg go nd ph pi gr nh pj pk pl pm pn po pp pq pr ps pt pu pv bj\" data-selectable-paragraph=\"\">1. Annotation \/ Labeling<\/h1>\n<p id=\"6b36\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">It is the most necessary step to be able to train in ML models. If you ask a data scientist how much time they spend improving the dataset and adapting an ML model, they will answer that it takes more than 50% of the entire ML development process. In other words, it is a challenging process that can be applied with different annotation strategies.<\/p>\n<figure class=\"mj mk ml mm mn mo\">\n<div class=\"qi is l eb\">\n<div class=\"qj qk l\"><iframe loading=\"lazy\" class=\"ek n fc dx bg\" title=\"Confused Season 3 GIF by The Simpsons - Find &amp; Share on GIPHY\" src=\"https:\/\/cdn.embedly.com\/widgets\/media.html?src=https%3A%2F%2Fgiphy.com%2Fembed%2F3o6Mb4xnceEKVDZ4Fq%2Ftwitter%2Fiframe&amp;display_name=Giphy&amp;url=https%3A%2F%2Fgiphy.com%2Fgifs%2Fseason-3-the-simpsons-3x13-3o6Mb4xnceEKVDZ4Fq&amp;image=https%3A%2F%2Fmedia4.giphy.com%2Fmedia%2F3o6Mb4xnceEKVDZ4Fq%2Fgiphy.gif%3Fcid%3D790b7611f8846bdb40a7dd392dad7bfe1cea0c5968568dea%26rid%3Dgiphy.gif%26ct%3Dg&amp;key=a19fcc184b9711e1b4764040d3dc5c07&amp;type=text%2Fhtml&amp;schema=giphy\" width=\"435\" height=\"331\" frameborder=\"0\" scrolling=\"no\" allowfullscreen=\"allowfullscreen\" data-mce-fragment=\"1\"><\/iframe><\/div>\n<\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\">Giphy \u2014 <a class=\"af mw\" href=\"https:\/\/giphy.com\/gifs\/season-3-the-simpsons-3x13-3o6Mb4xnceEKVDZ4Fq\" target=\"_blank\" rel=\"noopener ugc nofollow\">Gif Souce<\/a><\/figcaption>\n<\/figure>\n<h2 id=\"0a8f\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">Simple and complex annotation strategies<\/strong><\/h2>\n<p id=\"a700\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">The annotation process can be straightforward. For example, based on a product\u2019s social media posts, it\u2019s possible to label it <em class=\"os\">\u201cpositive,\u201d<\/em> <em class=\"os\">\u201cnegative,\u201d<\/em> or <em class=\"os\">\u201cneutral<\/em>\u201d to analyze sensitive trends about the product. For this, you can create and distribute an HTML form within a few hours. A simple HTML form could allow someone to rate each social media post by emotion option. Each rating becomes the label on the social media feed for your training data, and you use it.<\/p>\n<p id=\"6118\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">On the other hand, the annotation process can be very <strong class=\"be ql\">complicated<\/strong>. If you want to label every object in a video with a simple bounding box, a simple HTML form isn\u2019t enough. You need a graphical interface such as Supervisely App, and it can take months of engineering to create a good user experience.<\/p>\n<div class=\"qm qn qo qp qq qr\">\n<div class=\"qs ab ik\">\n<div class=\"qt ab cn ca qu qv\">\n<p class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/supervise.ly\/?source=post_page-----10c975ff3b1b--------------------------------\">Supervisely: unified OS for computer vision<\/a><\/p>\n<\/div>\n<\/div>\n<\/div>\n<h2 id=\"b2ac\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">Filling the gap in data science knowledge<\/strong><\/h2>\n<p id=\"d0b1\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">Algorithms and annotations are equally crucial for robust and successful ML applications. These two are nested components. You will usually get better accuracy from your models if you have a combined approach. It is also beneficial to simultaneously optimize your strategy regarding ML algorithms and data.<\/p>\n<p id=\"dc20\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">There are ML courses in almost every computer science department curriculum, but few offer enough information on how to plan training data for ML. You\u2019ll maybe come across a course or two on training data strategy among the hundreds of ML courses. Fortunately, this approach is quietly changing.<\/p>\n<p id=\"fb93\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">We witness that people in academia and the private sector try to walk on the same difficult road differently. In the private sector, as opposed to ML in academic studies, it is more common to improve model performance by adding more training data. Especially when data changes over time (which is very common), just adding some new labeled data can be much more effective than adapting an existing ML model to a new field. However, most academic papers have focused on adapting algorithms to a new domain without adding new training datasets rather than on how to efficiently label accurate and up-to-date training datasets. The reason for this is, of course, the difficulty and limitation of accessing quality datasets.<\/p>\n<p id=\"7247\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Increasingly, human resources know how to work with state-of-the-art ML algorithms but have no experience in choosing the proper interfaces to design quality and labeled training datasets. In other words, we are faced with many experts who know algorithms but do not understand data. <a class=\"af mw\" href=\"https:\/\/meticulousblog.org\/top-10-companies-in-automotive-artificial-intelligence-market\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">It is possible to recognize this recently in one of the biggest automobile manufacturers in the world.<\/a><\/p>\n<div class=\"qm qn qo qp qq qr\">\n<div class=\"qs ab ik\">\n<div class=\"qt ab cn ca qu qv\">\n<p class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/streetfins.com\/inside-teslas-crazy-ai-manufacturing-revolution\/?source=post_page-----10c975ff3b1b--------------------------------\">Inside Tesla&#8217;s Crazy AI Manufacturing Revolution | StreetFins\u00ae<\/a><\/p>\n<\/div>\n<\/div>\n<\/div>\n<p id=\"5d06\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">They employ many new ML\/AI engineers but struggle to make their autonomous vehicle technology functional because they cannot scale their data annotation and labeling strategies. To help scale they should consider rebuilding their process around two components. These two components are equally crucial for a well-performing ML implementation\/application:<\/p>\n<ul class=\"\">\n<li id=\"fd1e\" class=\"nv nw fo be b gm on ny nz gp oo ob oc ni pw oe of nm px oh oi nq py ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Algorithms<\/li>\n<li id=\"4009\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Correctly created (quality\/fair) training data<\/li>\n<\/ul>\n<h2 id=\"02e7\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">Why is it difficult for humans to label data with quality?<\/strong><\/h2>\n<p id=\"f4fa\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">Labeling is a subject of study closely tied to Data Science and ML for researchers. Therefore, it is an essential part of data science. But the most obvious example of the difficulty of this process is that the people providing the labels can make mistakes. Tackling these errors requires surprisingly complex statistics.<\/p>\n<p id=\"4648\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Human errors in training data can be more or less critical depending on the use case. If an ML model is only used to identify broad trends in consumer sentiment, it probably doesn\u2019t matter that 1% of the errors are due to erroneous training datasets. But it could be disastrous if an ML algorithm powering an autonomous vehicle couldn\u2019t see 1% of pedestrians due to errors emanating from inaccurate training datasets. Some algorithms may use some noise in the training data, and random noise helps some algorithms to produce more accurate results and be generalizable by preventing overfitting.<\/p>\n<blockquote class=\"ot ou ov\"><p id=\"8d82\" class=\"nv nw os be b gm on ny nz gp oo ob oc ow op oe of ox oq oh oi oy or ok ol om fh bj\" data-selectable-paragraph=\"\">But human errors don\u2019t tend to be random noise and thus tend to add irreversible bias to the training data.<\/p><\/blockquote>\n<p id=\"b612\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">In this blog, discussing the importance of data annotation and labeling as science may not excite everyone. Labeling is humankind\u2019s first step to cooperating with machines. I mean, labeling gets humans in the loop of ML from the beginning.<\/p>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"fh fi fj fk fl\">\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<blockquote class=\"rp\"><p id=\"0e79\" class=\"rq rr fo be rs rt ru rv rw rx ry om dv\" data-selectable-paragraph=\"\">Innovation and academia go hand-in-hand. Listen to our own <a class=\"af mw\" href=\"https:\/\/www.youtube.com\/watch?v=7XCsi64HLQ8.\" target=\"_blank\" rel=\"noopener ugc nofollow\">CEO Gideon Mendels chat with the Stanford MLSys Seminar Series team<\/a> about the future of MLOps and <a class=\"af mw\" href=\"\/signup\/\" target=\"_blank\" rel=\"noopener ugc nofollow\">give the Comet platform a try for free<\/a>!<\/p><\/blockquote>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"fh fi fj fk fl\">\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<h1 id=\"c1b3\" class=\"pe my fo be mz pf rz go nd ph sa gr nh pj sb pl pm pn sc pp pq pr sd pt pu pv bj\" data-selectable-paragraph=\"\">2. Active Learning<\/h1>\n<h2 id=\"97f0\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\">Increasing speed and decreasing cost<\/h2>\n<p id=\"24a3\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">Supervised learning models need more labeled data. It is more successful when it uses more data. Active learning is the process of choosing which data should be labeled. Most research articles on active learning have focused on the number of training data. But speed can be an even more critical factor in many cases.<\/p>\n<p id=\"6081\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">For example, When working in disaster response, ML models are often used to filter and extract information from emerging disasters. Any delay in disaster response is potentially critical. That\u2019s why getting a usable model becomes more important than the number of labels that need to go into that model.<\/p>\n<figure class=\"mj mk ml mm mn mo mg mh paragraph-image\">\n<div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:700\/0*i91lWBVIbz7K6T-z.png\" alt=\"\" width=\"700\" height=\"368\"><\/figure><div class=\"mg mh oz\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*i91lWBVIbz7K6T-z.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*i91lWBVIbz7K6T-z.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*i91lWBVIbz7K6T-z.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*i91lWBVIbz7K6T-z.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*i91lWBVIbz7K6T-z.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*i91lWBVIbz7K6T-z.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/format:webp\/0*i91lWBVIbz7K6T-z.png 1400w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*i91lWBVIbz7K6T-z.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*i91lWBVIbz7K6T-z.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*i91lWBVIbz7K6T-z.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*i91lWBVIbz7K6T-z.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*i91lWBVIbz7K6T-z.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*i91lWBVIbz7K6T-z.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/0*i91lWBVIbz7K6T-z.png 1400w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\" data-testid=\"og\"><\/picture><\/div>\n<\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/labelyourdata.com\/articles\/human-in-the-loop-in-machine-learning\" target=\"_blank\" rel=\"noopener ugc nofollow\">How HITL in ML works<\/a><\/figcaption>\n<\/figure>\n<p id=\"45c0\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Just as no single algorithm, architecture, or parameter set will make an ML model more accurate in every situation, Nor is there a single strategy for active learning that will be optimal for all use cases and datasets. However, as with ML models, there are some approaches you should try first. Because they are more likely to work. Let\u2019s talk about these strategies now.<\/p>\n<h2 id=\"fbc7\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">The three most commonly used active learning sampling strategies: Uncertainty, Diversity, and Randomness<\/strong><\/h2>\n<p id=\"c4cc\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">There are different active learning strategies and many algorithms to implement them. Three basic approaches, most of which work well and should almost always be a starting point, are:<\/p>\n<ul class=\"\">\n<li id=\"d885\" class=\"nv nw fo be b gm on ny nz gp oo ob oc ni pw oe of nm px oh oi nq py ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Uncertainty Sampling<\/li>\n<li id=\"52e4\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Diversity Sampling<\/li>\n<li id=\"cfeb\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\">Random Sampling<\/li>\n<\/ul>\n<p id=\"edf1\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\"><strong class=\"be ql\">Random Sampling<\/strong> is the simplest but can actually become the most difficult. What if your data is pre-filtered as it changes over time, or if, for some other reason, you know that a random sample set will not represent the problem you are addressing? Regardless of the strategy, some amount of random data always needs to be disclosed and labeled to measure the accuracy of your model and compare your active learning strategies based on randomly selected items.<\/p>\n<p id=\"2a96\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\"><strong class=\"be ql\">Uncertainty Sampling (Exploitation)<\/strong> is a strategy for identifying unlabeled elements near the decision boundary in your current ML model. If you have a binary classification task, these will be items with an estimated 50% probability of belonging to both labels. Therefore, the model is \u201cambiguous\u201d or \u201ccomplex.\u201d Misclassification of these items is most likely. Thus, it is most likely to result in a different label than the predicted one. It changes the decision boundary after it is added to the training data and the model is retrained.<\/p>\n<p id=\"6177\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\"><strong class=\"be ql\">Diversity Sampling (Exploration)<\/strong> is a strategy used to identify unlabeled items currently unknown by the ML model. This usually means items that contain combinations of rare or invisible attribute values \u200b\u200bin the training data. Diversity sampling aims to target the ML algorithm for more labels that are outliers or anomalies in the problem domain.<\/p>\n<p id=\"706d\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Both Uncertainty Sampling and Diversity Sampling have their own shortcomings. Uncertainty Sampling can only focus on the part of the decision boundary. Diversity Sampling can only focus on outliers that are too far from the border. For this reason, strategies are often used together to find a selection of unlabeled items that will maximize both Uncertainty and Diversity.<\/p>\n<p id=\"b3ce\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">In the following illustrations, the positive and negative aspects of different types of active learning have been tried to be expressed.<\/p>\n<\/div>\n<\/div>\n<div class=\"mo\">\n<div class=\"ab ca\">\n<div class=\"se sf sg sh si sj ce sk cf sl ch bg\">\n<div class=\"mj mk ml mm mn ab ki\">\n<figure class=\"le mo sm sn so sp sq paragraph-image\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:589\/0*yRy-IwQK7MZSe_2A.png\" alt=\"\" width=\"502\" height=\"443\"><\/figure><div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*yRy-IwQK7MZSe_2A.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*yRy-IwQK7MZSe_2A.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*yRy-IwQK7MZSe_2A.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*yRy-IwQK7MZSe_2A.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*yRy-IwQK7MZSe_2A.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*yRy-IwQK7MZSe_2A.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1004\/format:webp\/0*yRy-IwQK7MZSe_2A.png 1004w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 502px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*yRy-IwQK7MZSe_2A.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*yRy-IwQK7MZSe_2A.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*yRy-IwQK7MZSe_2A.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*yRy-IwQK7MZSe_2A.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*yRy-IwQK7MZSe_2A.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*yRy-IwQK7MZSe_2A.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1004\/0*yRy-IwQK7MZSe_2A.png 1004w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 502px\" data-testid=\"og\"><\/picture><\/div>\n<\/figure>\n<figure class=\"le mo sr sn so sp sq paragraph-image\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:586\/0*7KQ6Ij8nxBoteoBd.png\" alt=\"\" width=\"499\" height=\"443\"><\/figure><div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:998\/format:webp\/0*7KQ6Ij8nxBoteoBd.png 998w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 499px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*7KQ6Ij8nxBoteoBd.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*7KQ6Ij8nxBoteoBd.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*7KQ6Ij8nxBoteoBd.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*7KQ6Ij8nxBoteoBd.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*7KQ6Ij8nxBoteoBd.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*7KQ6Ij8nxBoteoBd.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:998\/0*7KQ6Ij8nxBoteoBd.png 998w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 499px\" data-testid=\"og\"><\/picture><\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv ss eb st su\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/www.manning.com\/books\/human-in-the-loop-machine-learning\" target=\"_blank\" rel=\"noopener ugc nofollow\">Left: A selection of uncertain items that are all from the same region of the feature space, and therefore lack diversity. Right: Diversity sampling, showing items selected to be labeled that are maximally different from the existing training items and from one another.<\/a><\/figcaption>\n<\/figure>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<p id=\"f504\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">It is important to note that the active learning process is iterative. In each iteration of active learning, a set of elements is defined and receives a new human-generated label, then the model is retrained with new data and the process is repeated.<\/p>\n<h2 id=\"0004\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">Random selection of evaluation data<\/strong><\/h2>\n<p id=\"a30a\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">It is easy to say that you should constantly evaluate the selection of randomly held data. However, it may not be that easy in practical terms.<\/p>\n<p id=\"f987\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">In the recent past, when researchers applied the<a class=\"af mw\" href=\"https:\/\/image-net.org\/download.php\" target=\"_blank\" rel=\"noopener ugc nofollow\"> well-known and extensive selection of data ImageNet datasets to ML models, they used 1,000 labels to identify the category of the image<\/a>, such as <em class=\"os\">\u201ctaper,\u201d<\/em> <em class=\"os\">\u201ctaxicab,\u201d<\/em> <em class=\"os\">\u201cswimming,\u201d<\/em> and other primary classes.<strong class=\"be ql\"> ImageNet competitions are judged on data retained for testing from this dataset and achieved near human-level precision in the randomly distributed dataset. However, if you take the same models and apply them to a random selection of images posted on a social media platform like Facebook or Instagram, the accuracy immediately decreases by at least ~10%.<\/strong><\/p>\n<p id=\"65e4\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">The data used from almost all ML applications change over time. If you\u2019ve been working on natural language processing, the topics people talk about will change over time, and the languages themselves will innovate and evolve in reasonably small chunks of time. If you\u2019ve been working on computer vision data, the types of objects you encounter change over time, and sometimes just as significantly, the images themselves change due to advances and changes in camera technology.<\/p>\n<p id=\"93a5\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">If you cannot define a random evaluation dataset, you should try to determine a representative evaluation dataset. If you describe a representative dataset, you agree that a truly random sample is impossible or not meaningful to your dataset. It\u2019s up to you to define what represents your use case, as the data will be determined by how you implement it.<\/p>\n<p id=\"a8a8\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">It is recommended for most real-world applications to have a divergent evaluation dataset that will allow you to get the best case for how well your model generalizes. This can be difficult with Active Learning because as soon as you start labeling this data, it is no longer a <em class=\"os\">\u201cdifferent dataset;\u201d<\/em> it becomes a set you know.<\/p>\n<h2 id=\"9bd1\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\"><strong class=\"al\">When should we use Active Learning?<\/strong><\/h2>\n<p id=\"f0b7\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">You should use Active Learning in situations where you can provide data diversity with labeling and random sampling to only a tiny fraction of your data. It covers most real-world scenarios, as the scale of this data becomes an essential factor in many use cases.<\/p>\n<p id=\"35ba\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">An excellent example of this is the amount of data contained in videos. If you want to put a bounding box around every object in every video frame, that would be very time-consuming. Imagine this is for an autonomous vehicle, and it\u2019s a street video with only about 20 objects you care about:<\/p>\n<p id=\"9728\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Let\u2019s say 10 other cars, 5 pedestrians, and 5 traffic signs. At 30 frames per second, that\u2019s (30 frames \u00d7 60 seconds \u00d7 20 objects). So, you need to create 36,000 boxes for just one minute of data!<\/p>\n<\/div>\n<\/div>\n<div class=\"mo\">\n<div class=\"ab ca\">\n<div class=\"se sf sg sh si sj ce sk cf sl ch bg\">\n<div class=\"mj mk ml mm mn ab ki\">\n<figure class=\"le mo sv sn so sp sq paragraph-image\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*czOVXi7HNM01nlR9.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*czOVXi7HNM01nlR9.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*czOVXi7HNM01nlR9.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*czOVXi7HNM01nlR9.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*czOVXi7HNM01nlR9.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*czOVXi7HNM01nlR9.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:650\/format:webp\/0*czOVXi7HNM01nlR9.png 650w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 325px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*czOVXi7HNM01nlR9.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*czOVXi7HNM01nlR9.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*czOVXi7HNM01nlR9.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*czOVXi7HNM01nlR9.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*czOVXi7HNM01nlR9.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*czOVXi7HNM01nlR9.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:650\/0*czOVXi7HNM01nlR9.png 650w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 325px\" data-testid=\"og\"><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:325\/0*czOVXi7HNM01nlR9.png\" alt=\"\" width=\"325\" height=\"276\"><\/picture><\/figure>\n<figure class=\"le mo sw sn so sp sq paragraph-image\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:637\/0*2O_vO8NUiaQ08W4H.png\" alt=\"\" width=\"619\" height=\"334\"><\/figure><div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*2O_vO8NUiaQ08W4H.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*2O_vO8NUiaQ08W4H.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*2O_vO8NUiaQ08W4H.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*2O_vO8NUiaQ08W4H.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*2O_vO8NUiaQ08W4H.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*2O_vO8NUiaQ08W4H.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1238\/format:webp\/0*2O_vO8NUiaQ08W4H.png 1238w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 619px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*2O_vO8NUiaQ08W4H.png 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*2O_vO8NUiaQ08W4H.png 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*2O_vO8NUiaQ08W4H.png 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*2O_vO8NUiaQ08W4H.png 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*2O_vO8NUiaQ08W4H.png 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*2O_vO8NUiaQ08W4H.png 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1238\/0*2O_vO8NUiaQ08W4H.png 1238w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 619px\" data-testid=\"og\"><\/picture><\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv sx eb sy su\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/www.manning.com\/books\/human-in-the-loop-machine-learning\" target=\"_blank\" rel=\"noopener ugc nofollow\">An example of multiple bounding boxes from multiple annotators. Overall agreement is calculated as the average pairwise IoU (Interest of Union) of all boxes.<\/a><\/figcaption>\n<\/figure>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<p id=\"8468\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Completing the labels required for just one minute of video can take at least 12 hours for even the fastest labeling person. In the US alone, people drive an average of 1 hour per day, which means that people in the US drive 95,104,400,000 hours per year.<\/p>\n<p id=\"9ff1\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">We expect that soon every car will have a video camera in front of it to assist with driving or driving. So in the US alone, it would take approximately 60,000,000 (60 Trillion) hours to label on a year of driving! Even if the rest of the world does nothing more than label data all day to make US drivers safer, there doesn\u2019t seem to be enough people to label videos of US drivers today.<\/p>\n<p id=\"dbe5\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Whatever an autonomous vehicle company\u2019s budget for labeling will be much less than the amount of data to be labeled. So, data scientists at the autonomous vehicle company have to decide about the labeling process: is every frame in a video appropriate? Can we add examples to videos, so we don\u2019t need to label them all? Are there ways to design a labeling interface to speed up the process?<\/p>\n<p id=\"db57\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">The unsustainability of labeling will apply in most cases: The point is that we will always have more data than the budget and time allocated for labeling.<\/p>\n<figure class=\"mj mk ml mm mn mo mg mh paragraph-image\">\n<div class=\"pa pb eb pc bg pd\" tabindex=\"0\" role=\"button\">\n<figure><img loading=\"lazy\" decoding=\"async\" class=\"bg mp mq c\" role=\"presentation\" src=\"https:\/\/miro.medium.com\/v2\/resize:fit:700\/0*0CTz6BDOSA2xIkBe.jpeg\" alt=\"\" width=\"700\" height=\"250\"><\/figure><div class=\"mg mh sz\"><picture><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/format:webp\/0*0CTz6BDOSA2xIkBe.jpeg 1400w\" type=\"image\/webp\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\"><source srcset=\"https:\/\/miro.medium.com\/v2\/resize:fit:640\/0*0CTz6BDOSA2xIkBe.jpeg 640w, https:\/\/miro.medium.com\/v2\/resize:fit:720\/0*0CTz6BDOSA2xIkBe.jpeg 720w, https:\/\/miro.medium.com\/v2\/resize:fit:750\/0*0CTz6BDOSA2xIkBe.jpeg 750w, https:\/\/miro.medium.com\/v2\/resize:fit:786\/0*0CTz6BDOSA2xIkBe.jpeg 786w, https:\/\/miro.medium.com\/v2\/resize:fit:828\/0*0CTz6BDOSA2xIkBe.jpeg 828w, https:\/\/miro.medium.com\/v2\/resize:fit:1100\/0*0CTz6BDOSA2xIkBe.jpeg 1100w, https:\/\/miro.medium.com\/v2\/resize:fit:1400\/0*0CTz6BDOSA2xIkBe.jpeg 1400w\" sizes=\"(min-resolution: 4dppx) and (max-width: 700px) 50vw, (-webkit-min-device-pixel-ratio: 4) and (max-width: 700px) 50vw, (min-resolution: 3dppx) and (max-width: 700px) 67vw, (-webkit-min-device-pixel-ratio: 3) and (max-width: 700px) 65vw, (min-resolution: 2.5dppx) and (max-width: 700px) 80vw, (-webkit-min-device-pixel-ratio: 2.5) and (max-width: 700px) 80vw, (min-resolution: 2dppx) and (max-width: 700px) 100vw, (-webkit-min-device-pixel-ratio: 2) and (max-width: 700px) 100vw, 700px\" data-testid=\"og\"><\/picture><\/div>\n<\/div>\n<figcaption class=\"mr ms mt mg mh mu mv be b bf z dv\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/hai.stanford.edu\/news\/humans-loop-design-interactive-ai-systems\" target=\"_blank\" rel=\"noopener ugc nofollow\">General blueprint for an HITL interactive AI system.<\/a><\/figcaption>\n<\/figure>\n<h1 id=\"b3b8\" class=\"pe my fo be mz pf pg go nd ph pi gr nh pj pk pl pm pn po pp pq pr ps pt pu pv bj\" data-selectable-paragraph=\"\">Closing<\/h1>\n<p id=\"ffa7\" class=\"pw-post-body-paragraph nv nw fo be b gm nx ny nz gp oa ob oc ni od oe of nm og oh oi nq oj ok ol om fh bj\" data-selectable-paragraph=\"\">Intelligent systems that learn interactively from their end users are rapidly becoming widespread. Until recently, this progress was mainly fueled by advances in ML; however, more and more researchers are aware of the importance of studying the users of these systems. You\u2019ve seen how this approach can result in better user experiences and more effective learning systems. There is no reason not to argue that interactive ML systems should involve users at every stage of the design process.<\/p>\n<p id=\"5b86\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">Human-computer interaction is a well-established field in computer science that has recently become particularly important for ML. It\u2019s a field where cognitive science, social sciences, psychology, user experience design, and many other fields intersect as we build interfaces for people to create educational data.<\/p>\n<p id=\"1c29\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">HITL ML is an iterative process that combines human and machine components. Active learning through labeling is only the first step. In another blog post, we can examine the transfer learning dimension of the subject.<\/p>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"fh fi fj fk fl\">\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<blockquote class=\"ot ou ov\"><p id=\"04af\" class=\"nv nw os be b gm on ny nz gp oo ob oc ow op oe of ox oq oh oi oy or ok ol om fh bj\" data-selectable-paragraph=\"\"><strong class=\"be ql\">Feel free to follow me on <\/strong><a class=\"af mw\" href=\"https:\/\/github.com\/ayyucekizrak\" target=\"_blank\" rel=\"noopener ugc nofollow\"><strong class=\"be ql\">GitHub<\/strong><\/a><strong class=\"be ql\"> and <\/strong><a class=\"af mw\" href=\"https:\/\/twitter.com\/ayyucekizrak\" target=\"_blank\" rel=\"noopener ugc nofollow\"><strong class=\"be ql\">Twitter<\/strong><\/a><strong class=\"be ql\"> accounts for more content!<\/strong><\/p><\/blockquote>\n<figure class=\"mj mk ml mm mn mo\">\n<div class=\"qi is l eb\">\n<div class=\"ta qk l\"><iframe loading=\"lazy\" class=\"ek n fc dx bg\" title=\"\" src=\"https:\/\/drive.google.com\/viewerng\/viewer?url=https%3A\/\/mneguidelines.oecd.org\/RBC-and-artificial-intelligence.pdf&amp;embedded=true\" width=\"600\" height=\"780\" frameborder=\"0\" scrolling=\"no\" allowfullscreen=\"allowfullscreen\" data-mce-fragment=\"1\"><\/iframe><\/div>\n<\/div>\n<\/figure>\n<p id=\"0dfd\" class=\"pw-post-body-paragraph nv nw fo be b gm on ny nz gp oo ob oc ni op oe of nm oq oh oi nq or ok ol om fh bj\" data-selectable-paragraph=\"\">I would like to thank <a class=\"af mw\" href=\"https:\/\/medium.com\/u\/cff0ed378ad5\" rel=\"noopener\"><em class=\"os\">Ba\u015fak Buluz K\u00f6me\u00e7o\u011flu<\/em><\/a> for her feedback on this blog post.<\/p>\n<h2 id=\"b073\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\" data-selectable-paragraph=\"\">Check out some other blog posts published on Heartbeat:<\/h2>\n<div class=\"qm qn qo qp qq qr\">\n<div class=\"qs ab ik\">\n<div class=\"qt ab cn ca qu qv\">\n<ul>\n<li class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/heartbeat.comet.ml\/towards-data-centric-ai-7a291ef2d508?source=post_page-----10c975ff3b1b--------------------------------\">Towards Data-Centric AI<\/a><\/li>\n<li class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/heartbeat.comet.ml\/what-is-algorithmic-bias-a01dd1bbe076?source=post_page-----10c975ff3b1b--------------------------------\">What is Algorithmic Bias?<\/a><\/li>\n<li class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/heartbeat.comet.ml\/reviewing-efficientnet-increasing-the-accuracy-and-robustness-of-cnns-6aaf411fc81d?source=post_page-----10c975ff3b1b--------------------------------\">Reviewing EfficientNet: Increasing the Accuracy and Robustness of CNNs<\/a><\/li>\n<li class=\"be fp ia z is qw iu iv qx ix iz fn bj\"><a href=\"https:\/\/heartbeat.comet.ml\/explainable-responsible-and-trustworthy-artificial-intelligence-47c140ebeb05?source=post_page-----10c975ff3b1b--------------------------------\">Explainable, Responsible, and Trustworthy Artificial Intelligence<\/a><\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"fh fi fj fk fl\">\n<div class=\"ab ca\">\n<div class=\"ch bg et eu ev ew\">\n<p id=\"8324\" class=\"mx my fo be mz na nb nc nd ne nf ng nh ni nj nk nl nm nn no np nq nr ns nt nu bj\">References:<\/p>\n<ul class=\"\">\n<li data-selectable-paragraph=\"\"><a href=\"https:\/\/www.manning.com\/books\/human-in-the-loop-machine-learning?source=post_page-----10c975ff3b1b--------------------------------\">Human-in-the-Loop Machine Learning<\/a><\/li>\n<li id=\"96db\" class=\"nv nw fo be b gm on ny nz gp oo ob oc ni pw oe of nm px oh oi nq py ok ol om pz qa qb bj\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/www.aaai.org\/ojs\/index.php\/aimagazine\/article\/view\/2513\" target=\"_blank\" rel=\"noopener ugc nofollow\">Power to the People: The Role of Humans in Interactive Machine Learning<\/a><\/li>\n<li id=\"e9f1\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/hai.stanford.edu\/blog\/humans-loop-design-interactive-ai-systems\" target=\"_blank\" rel=\"noopener ugc nofollow\">Humans in the Loop: The Design of Interactive AI Systems<\/a><\/li>\n<li id=\"f091\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/01\/Guidelines-for-Human-AI-Interaction-camera-ready.pdf\" target=\"_blank\" rel=\"noopener ugc nofollow\">Guidelines for Human-AI Interaction<\/a><\/li>\n<li id=\"892b\" class=\"nv nw fo be b gm qc ny nz gp qd ob oc ni qe oe of nm qf oh oi nq qg ok ol om pz qa qb bj\" data-selectable-paragraph=\"\"><a class=\"af mw\" href=\"https:\/\/cpb-us-e1.wpmucdn.com\/sites.northwestern.edu\/dist\/3\/3481\/files\/2012\/11\/Gerber_PrimingforBetterPerformanceInMicrotaskCrowdsourcing.pdf\" target=\"_blank\" rel=\"noopener ugc nofollow\">Priming for Better Performance in Microtask Crowdsourcing Environments<\/a><\/li>\n<\/ul>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Inca Knot Writing Quipu-Image Source Is it necessary for humans to take part in the machine learning cycle? Contrary to what the movies we watch show us, today\u2019s artificial intelligence (AI) cannot do everything and learn everything on its own. It primarily, and to a large extent, needs the feedback it receives from people. I [&hellip;]<\/p>\n","protected":false},"author":38,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"customer_name":"","customer_description":"","customer_industry":"","customer_technologies":"","customer_logo":"","_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[6],"tags":[],"coauthors":[115],"class_list":["post-7458","post","type-post","status-publish","format-standard","hentry","category-machine-learning"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.9 (Yoast SEO v25.9) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Human-in-the-Loop Machine Learning - Comet<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Human-in-the-Loop Machine Learning\" \/>\n<meta property=\"og:description\" content=\"Inca Knot Writing Quipu-Image Source Is it necessary for humans to take part in the machine learning cycle? Contrary to what the movies we watch show us, today\u2019s artificial intelligence (AI) cannot do everything and learn everything on its own. It primarily, and to a large extent, needs the feedback it receives from people. I [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/\" \/>\n<meta property=\"og:site_name\" content=\"Comet\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/cometdotml\" \/>\n<meta property=\"article:published_time\" content=\"2023-09-13T00:00:28+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-04-24T17:14:10+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg\" \/>\n<meta name=\"author\" content=\"Ayyuce Kizrak\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Cometml\" \/>\n<meta name=\"twitter:site\" content=\"@Cometml\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ayyuce Kizrak\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"15 minutes\" \/>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Human-in-the-Loop Machine Learning - Comet","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/","og_locale":"en_US","og_type":"article","og_title":"Human-in-the-Loop Machine Learning","og_description":"Inca Knot Writing Quipu-Image Source Is it necessary for humans to take part in the machine learning cycle? Contrary to what the movies we watch show us, today\u2019s artificial intelligence (AI) cannot do everything and learn everything on its own. It primarily, and to a large extent, needs the feedback it receives from people. I [&hellip;]","og_url":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/","og_site_name":"Comet","article_publisher":"https:\/\/www.facebook.com\/cometdotml","article_published_time":"2023-09-13T00:00:28+00:00","article_modified_time":"2025-04-24T17:14:10+00:00","og_image":[{"url":"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg","type":"","width":"","height":""}],"author":"Ayyuce Kizrak","twitter_card":"summary_large_image","twitter_creator":"@Cometml","twitter_site":"@Cometml","twitter_misc":{"Written by":"Ayyuce Kizrak","Est. reading time":"15 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#article","isPartOf":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/"},"author":{"name":"Ayyuce Kizrak","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/06ea8c9cc060b86368361ec497fae86d"},"headline":"Human-in-the-Loop Machine Learning","datePublished":"2023-09-13T00:00:28+00:00","dateModified":"2025-04-24T17:14:10+00:00","mainEntityOfPage":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/"},"wordCount":2802,"publisher":{"@id":"https:\/\/www.comet.com\/site\/#organization"},"image":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg","articleSection":["Machine Learning"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/","url":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/","name":"Human-in-the-Loop Machine Learning - Comet","isPartOf":{"@id":"https:\/\/www.comet.com\/site\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#primaryimage"},"image":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#primaryimage"},"thumbnailUrl":"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg","datePublished":"2023-09-13T00:00:28+00:00","dateModified":"2025-04-24T17:14:10+00:00","breadcrumb":{"@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#primaryimage","url":"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg","contentUrl":"https:\/\/miro.medium.com\/v2\/resize:fit:625\/0*49Eg8Z6-NJjLT8qA.jpg"},{"@type":"BreadcrumbList","@id":"https:\/\/www.comet.com\/site\/blog\/human-in-the-loop-machine-learning\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.comet.com\/site\/"},{"@type":"ListItem","position":2,"name":"Human-in-the-Loop Machine Learning"}]},{"@type":"WebSite","@id":"https:\/\/www.comet.com\/site\/#website","url":"https:\/\/www.comet.com\/site\/","name":"Comet","description":"Build Better Models Faster","publisher":{"@id":"https:\/\/www.comet.com\/site\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.comet.com\/site\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.comet.com\/site\/#organization","name":"Comet ML, Inc.","alternateName":"Comet","url":"https:\/\/www.comet.com\/site\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/#\/schema\/logo\/image\/","url":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2025\/01\/logo_comet_square.png","contentUrl":"https:\/\/www.comet.com\/site\/wp-content\/uploads\/2025\/01\/logo_comet_square.png","width":310,"height":310,"caption":"Comet ML, Inc."},"image":{"@id":"https:\/\/www.comet.com\/site\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/cometdotml","https:\/\/x.com\/Cometml","https:\/\/www.youtube.com\/channel\/UCmN63HKvfXSCS-UwVwmK8Hw"]},{"@type":"Person","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/06ea8c9cc060b86368361ec497fae86d","name":"Ayyuce Kizrak","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.comet.com\/site\/#\/schema\/person\/image\/d060b010f203ff9b479a8070f8b5aab3","url":"https:\/\/secure.gravatar.com\/avatar\/1ae1128bb0d30e171c0c279852fcfa94667474deebf74411ab2e06d0ac5bbda3?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/1ae1128bb0d30e171c0c279852fcfa94667474deebf74411ab2e06d0ac5bbda3?s=96&d=mm&r=g","caption":"Ayyuce Kizrak"},"url":"https:\/\/www.comet.com\/site\/blog\/author\/ayyucekizra\/"}]}},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts\/7458","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/users\/38"}],"replies":[{"embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/comments?post=7458"}],"version-history":[{"count":1,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts\/7458\/revisions"}],"predecessor-version":[{"id":15547,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/posts\/7458\/revisions\/15547"}],"wp:attachment":[{"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/media?parent=7458"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/categories?post=7458"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/tags?post=7458"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.comet.com\/site\/wp-json\/wp\/v2\/coauthors?post=7458"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}