{"id":3876,"date":"2020-09-11T12:35:13","date_gmt":"2020-09-11T10:35:13","guid":{"rendered":"https:\/\/immune.institute\/?p=3876"},"modified":"2020-09-11T12:35:13","modified_gmt":"2020-09-11T10:35:13","slug":"a-little-bit-of-strange-interesting-datasets-for-machine-learning","status":"publish","type":"post","link":"https:\/\/immune.institute\/en\/blog\/a-little-bit-of-strange-interesting-datasets-for-machine-learning\/","title":{"rendered":"A little bit of strange\/interesting Datasets for Machine Learning"},"content":{"rendered":"<h3><span style=\"color: #ffffff;\">A review outside the common datasets for Machine Learning<\/span><\/h3>\n<p>When you begin in the Machine Learning field, you usually use the common datasets such as MNIST, Iris, the 20 newsgroups, ... But there are hundreds of rare and interesting datasets that can be found online. At Immune Technology Institute we have asked our teachers to create a list of the most strange datasets they have found. Here we go!<\/p>\n<h3><span style=\"color: #ffffff;\">Price of Weed<\/span><\/h3>\n<p>This is a repository which contains a registry of the historical marijuana prices, which shows significant differentiation at the state level in prices. The question here is how the data has been collected?<\/p>\n<p style=\"text-align: center;\"><img decoding=\"async\" class=\"size-full wp-image-8218 aligncenter\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/frysuspicious-1.gif\" alt=\"\" width=\"213\" height=\"160\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/frysuspicious-1.gif 213w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/frysuspicious-1-16x12.gif 16w\" sizes=\"(max-width: 213px) 100vw, 213px\" \/><\/p>\n<p>Although it may seem a useless dataset, it may be very relevant in the times we live in, as many countries are considering legalising marijuana.<\/p>\n<h3><span style=\"color: #ffffff;\">Length of chopsticks<\/span><\/h3>\n<p>If you have never asked yourself, as is normal, what is the optimal length of chopsticks, no worries, someone has asked this question before. A researcher team tried to evaluate the effects of the length of the chopsticks on the food-serving performance of adults and children. For this reason, they created this dataset for finding the optimal length of chopsticks.<\/p>\n<p><img decoding=\"async\" class=\"size-full wp-image-8210 aligncenter\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1.jpeg\" alt=\"\" width=\"1024\" height=\"683\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1.jpeg 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1-256x171.jpeg 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1-512x342.jpeg 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1-768x512.jpeg 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/pexels-foodie-factor-539430-1024x683-1-18x12.jpeg 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>They concluded that the food-pinching performance was considerably affected by the length of the chopsticks. The researchers suggested that families with children should provide both 240 and 180 mm long chopsticks. In addition, restaurants could provide 210 mm long chopsticks, considering the trade-offs between ergonomics and cost.<\/p>\n<h3><span style=\"color: #ffffff;\">Rice Images<\/span><\/h3>\n<p>A dataset which contains more than of 3500 rice grain's images of 2 different species. Different properties were extracted from each grain of rice, such as:<\/p>\n<ul>\n<li>The longest line that can be drawn on the rice grain<\/li>\n<li>The shortest line that can be drawn on the rice grain<\/li>\n<li>Or the perimeter of each grain.<\/li>\n<\/ul>\n<h3><span style=\"color: #ffffff;\">Popular dog names in Sweden<\/span><\/h3>\n<p>Did you know that the most popular dog name in Sweden is Molly?<\/p>\n<p><img decoding=\"async\" class=\"size-full wp-image-8220 aligncenter\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1.png\" alt=\"\" width=\"1024\" height=\"247\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1.png 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1-256x62.png 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1-512x124.png 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1-768x185.png 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image6-1024x247-1-1-18x4.png 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>This dataset collects the most popular dog names in Sweden in 2018 by number of animals. Bella ranked the second most popular name, with almost six thousand animals, followed by the name Charlie, reaching a number of approximately 4600.<\/p>\n<h2><\/h2>\n<h3><span style=\"color: #ffffff;\">Flags Data Set<\/span><\/h3>\n<p>I am pretty sure that Sheldon will love this one... This dataset contains details of various nations and their flags, such as:<\/p>\n<ul>\n<li>The religion of each country.<\/li>\n<li>The predominant colour in the flag.<\/li>\n<li>If the flag contains a crescent moon or sunstars.<\/li>\n<li>If it contains an eagle, a tree, ...<\/li>\n<li style=\"list-style-type: none;\"><\/li>\n<\/ul>\n<p>Maybe it is interesting for predicting the religion of a country from its size and the colours in its flag.Sometimes it is also interesting to see how people find relationships in data where they are not visible to the naked eye. This website is an expert in finding correlations where no one else can find them, for example:<\/p>\n<h4><span style=\"color: #ffffff;\">Cheese consumption vs Number of people who died by becoming tangled in their sheets<\/span><\/h4>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-8212\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1.png\" alt=\"\" width=\"1024\" height=\"403\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1.png 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1-256x101.png 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1-512x202.png 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1-768x302.png 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image7-1024x403-1-18x7.png 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<h4><span style=\"color: #ffffff;\">Math doctorates awarded vs Uranium stored at US nuclear power plants<\/span><\/h4>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-8214\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1.png\" alt=\"\" width=\"1024\" height=\"403\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1.png 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1-256x101.png 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1-512x202.png 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1-768x302.png 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image2-1-1024x403-1-18x7.png 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<h4><span style=\"color: #ffffff;\">Total revenues generated by arcades vs Computer science doctorates awarded in the US<\/span><\/h4>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-8215\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1.png\" alt=\"\" width=\"1024\" height=\"403\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1.png 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1-256x101.png 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1-512x202.png 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1-768x302.png 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image5-1-1024x403-1-18x7.png 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>Discover new correlations using this website and share your results with us! ?<\/p>\n<p><img decoding=\"async\" class=\"size-full wp-image-8221 aligncenter\" src=\"https:\/\/principal.immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1.png\" alt=\"\" width=\"1024\" height=\"234\" srcset=\"https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1.png 1024w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1-256x59.png 256w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1-512x117.png 512w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1-768x176.png 768w, https:\/\/immune.institute\/wp-content\/uploads\/2020\/09\/image3-1-1024x234-1-18x4.png 18w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<h3><span style=\"color: #ffffff;\">Who we are?<\/span><\/h3>\n<p>At <a href=\"https:\/\/bit.ly\/3hmbnxO\" target=\"_blank\" rel=\"noopener\">Immune Technology Institute<\/a> we try to apply and teach the most advanced technology at the computational field. Furthermore, we love sharing knowledge since we consider that it is when it becomes powerful.<\/p>\n<p>If you want to learn how to develop real-world applications or how to handle large amounts of data, you could be interested in our <a href=\"https:\/\/bit.ly\/33ix7Wn\" target=\"_blank\" rel=\"noopener\">Master in Data Science<\/a>. It is a program aimed at professionals that seek to specialize in Data Science, know the main Artificial Intelligence techniques and how to apply them into different industries.<\/p>\n<p>We will host an <a href=\"https:\/\/bit.ly\/32hJlPK\" target=\"_blank\" rel=\"noopener\">online information session<\/a> on September 24, with the director of the master, M\u00f3nica Villas. IMMUNE can help you boost your career through its partner companies and contacts with recruiters and professionals in the sector. You can sign up now.<\/p>\n<h3><span style=\"color: #ffffff;\"> Wait one more thing - Datathon <\/span><\/h3>\n<p>Do you want to be a data scientist? Sign up for the virtual Datathon organised by IMMUNE Technology Institute in collaboration with Spanish Startups on September 19th. Online training from the best data experts and a great challenge to test your knowledge. Don't miss out on the prize! You can sign up to the <a href=\"https:\/\/bit.ly\/3bLpDiu\" target=\"_blank\" rel=\"noopener\">Datathon<\/a> now<\/p>\n<p style=\"text-align: right;\">This article has been written by: <a href=\"https:\/\/medium.com\/u\/3b43171da13b\" target=\"_blank\" rel=\"noopener\">Alejandro Diaz Santos<\/a>- (<a href=\"https:\/\/www.linkedin.com\/in\/alejandro-diaz-santos-8aab812a\/\" target=\"_blank\" rel=\"noopener\">LinkedIn<\/a>, GitHub) for IMMUNE Technology Institute.<\/p>","protected":false},"excerpt":{"rendered":"<p>A review outside the common datasets for Machine Learning When you begin in the Machine Learning field, you usually use the common datasets such as MNIST, Iris, the 20 newsgroups, \u2026 But there are hundreds of rare and interesting datasets that can be found online. At Immune Technology Institute we have asked our teachers to [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":8210,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_crdt_document":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-3876","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog"],"acf":[],"_links":{"self":[{"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/posts\/3876","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/comments?post=3876"}],"version-history":[{"count":0,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/posts\/3876\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/media\/8210"}],"wp:attachment":[{"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/media?parent=3876"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/categories?post=3876"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/immune.institute\/en\/wp-json\/wp\/v2\/tags?post=3876"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}