<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>open-data | Eviota</title>
    <link>https://eviota.eu/tag/open-data/</link>
      <atom:link href="https://eviota.eu/tag/open-data/index.xml" rel="self" type="application/rss+xml" />
    <description>open-data</description>
    <generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><lastBuildDate>Thu, 03 Nov 2022 17:30:00 +0000</lastBuildDate>
    <image>
      <url>https://eviota.eu/media/icon_hucb5b60c3ce6d78a39dc1b8454aefe125_21353_512x512_fill_lanczos_center_3.png</url>
      <title>open-data</title>
      <link>https://eviota.eu/tag/open-data/</link>
    </image>
    
    <item>
      <title>Big Data for All: Building Collaborative Data Observatories</title>
      <link>https://eviota.eu/talk/big-data-for-all-building-collaborative-data-observatories/</link>
      <pubDate>Thu, 03 Nov 2022 17:30:00 +0000</pubDate>
      <guid>https://eviota.eu/talk/big-data-for-all-building-collaborative-data-observatories/</guid>
      <description>&lt;p&gt;Reprex&amp;rsquo;s co-founder, &lt;a href=&#34;https://eviota.eu/authors/daniel_antal&#34;&gt;Daniel Antal&lt;/a&gt; talked in the &lt;a href=&#34;https://www.ehvinnovationcafe.org/past-events/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Eindhoven Innovation Café&lt;/a&gt; about these issues. You can watch the recorded version of the the livestream that starts at 5 minutes and 22 seconds:&lt;/p&gt;

&lt;div style=&#34;position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;&#34;&gt;
  &lt;iframe src=&#34;https://www.youtube.com/embed/kM54gAAbHY0&#34; style=&#34;position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;&#34; allowfullscreen title=&#34;YouTube Video&#34;&gt;&lt;/iframe&gt;
&lt;/div&gt;

&lt;p&gt;&lt;em&gt;This is a past event&lt;/em&gt;. Check out our forthcoming &lt;a href=&#34;https://eviota.eu/#talks&#34;&gt;events&lt;/a&gt; or write to &lt;a href=&#34;https://www.linkedin.com/in/antaldaniel/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;
  &lt;i class=&#34;fab fa-linkedin  pr-1 fa-fw&#34;&gt;&lt;/i&gt; Daniel Antal&lt;/a&gt;  or to &lt;a href=&#34;https://keybase.io/antaldaniel&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;
  &lt;i class=&#34;fab fa-keybase  pr-1 fa-fw&#34;&gt;&lt;/i&gt; antaldaniel&lt;/a&gt;. Or send an &lt;a href=&#34;https://eviota.eu/contact/&#34;&gt;
  &lt;i class=&#34;fas fa-envelope  pr-1 fa-fw&#34;&gt;&lt;/i&gt; email&lt;/a&gt;.&lt;/p&gt;
&lt;h2 id=&#34;the-event-invitation-text-and-links&#34;&gt;The event invitation text and links&lt;/h2&gt;
&lt;p&gt;&lt;code&gt;Big data and AI creates inequalities&lt;/code&gt;. It puts historically marginalized people, like ethnic minorities, and womxn, at a disadvantage. Because AI and checking on AI require plenty of data, usually only giant corporations, the wealthiest governments, and university entities can make it work for them. Reprex is a Hague-based, international startup that wants to impact various sustainable development goals by enabling smaller organizations to join their smaller datasets, use open data, create linked available data, and collaboratively make a change.&lt;/p&gt;
&lt;p&gt;Reprex is a finalist for the &lt;code&gt;Hague Innovation Award&lt;/code&gt; for impact startup (please 🙏, &lt;a href=&#34;https://reprex.nl/post/2022-10-29_reprex-talk-to-all/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;vote for us&lt;/a&gt;!). Daniel Antal, one of the co-founders, will talk about their approach to building an international coalition of music organizations to pool data and challenge data monopolies using organizational techniques, a collaboration ethos, and data from the open-source developer world.&lt;/p&gt;
&lt;p&gt;Using the example of independent music creators, who often find themselves in a position where it is more expensive to claim their money from global platforms, he will talk about how to reduce inequalities in the world of big data and AI with collaboration on web 3.0. In the Q&amp;amp;A he will take questions on how to apply their know-how, and generally linked open data to other art+tech or creative segments or problems for which everybody is too small, like meeting the Paris Accord greenhouse gas targets bit by bit, small company by small company.&lt;/p&gt;
&lt;h2 id=&#34;in-the-qa-we-can-discuss-many-things&#34;&gt;In the Q&amp;amp;A, we can discuss many things&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; How can Reprex help an individual creator in music, or in fashion and design, or any other area?&lt;/li&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; What sort of help it can give to researchers, research institutes, specialist consultancies, law firms, and other knowledge-based actors?&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;What sort of partners is &lt;a href=&#34;https://reprex.nl/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Reprex&lt;/a&gt; looking for in &lt;code&gt;Eindhoven&lt;/code&gt;?&lt;/p&gt;
&lt;h2 id=&#34;check-out-our-projects&#34;&gt;Check out our projects&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; &lt;a href=&#34;https://music.dataobservatory.eu/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Digital Music Observatory&lt;/a&gt; and &lt;a href=&#34;https://music.dataobservatory.eu/project/listen-local/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Listen Local&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; &lt;a href=&#34;https://ccsi.dataobservatory.eu/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Cultural &amp;amp; Creative Sectors and Industries Observatory&lt;/a&gt; and short call for potential partners.&lt;/li&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; G&lt;a href=&#34;https://greendeal.dataobservatory.eu/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;reen Deal Data Observatory&lt;/a&gt; and simple, connected, financial and sustainability reporting for creative enterprises and others&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;reprex-the-impact-startup&#34;&gt;Reprex: the impact startup&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;&lt;input checked=&#34;&#34; disabled=&#34;&#34; type=&#34;checkbox&#34;&gt; Check out our accomplishments since the foundation in 2020&lt;/li&gt;
&lt;/ul&gt;
</description>
    </item>
    
    <item>
      <title>Crunchconf: Open Data, New Gold Without the Rush</title>
      <link>https://eviota.eu/talk/crunchconf-open-data-new-gold-without-the-rush/</link>
      <pubDate>Fri, 08 Oct 2021 10:10:00 +0000</pubDate>
      <guid>https://eviota.eu/talk/crunchconf-open-data-new-gold-without-the-rush/</guid>
      <description>&lt;p&gt;Every year, the EU announces that billions and billions of data are now “open” again, but this is not gold. At least not in the form of nicely minted gold coins, but in gold dust and nuggets found in the muddy banks of chilly rivers. There is no rush for it, because panning out its value requires a lot of hours of hard work. Our goal is to automate this work to make open data usable at scale, even in trustworthy AI solutions.&lt;/p&gt;
&lt;h2 id=&#34;summary&#34;&gt;Summary&lt;/h2&gt;
&lt;p&gt;In his presentation, Daniel compared the current state of open data (including governmental open data and scientific open data) to a thrift store.  You can often find bargains, or historical data that would be impossible to source from data vendors, but on a strictly as-is basis, without a catalogue, service, or guarantee. Therefore, working with open data requires a careful reprocessing, validation, and in many cases, frequent re-validation. Open data is often over-estimated: it is never a finished product, often it cannot even be downloaded, therefore it requires further investment to make it valuable. However, because most open data arrives from the governmental sector, you can tap into information sources where no market alternative exists.  Open data in some cases may be a cheaper substitute to market vendors, but often it is an exclusive source of information that do not have any market vendors.&lt;/p&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;















&lt;figure  id=&#34;figure-sisyphus-was-punished-by-being-forced-to-roll-an-immense-boulder-up-a-hill-only-for-it-to-roll-down-every-time-it-neared-the-top-repeating-this-action-for-eternity--this-is-the-price-that-project-managers-and-analysts-pay-for-the-inadequate-documentation-of-their-data-assets&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;Sisyphus was punished by being forced to roll an immense boulder up a hill only for it to roll down every time it neared the top, repeating this action for eternity.  This is the price that project managers and analysts pay for the inadequate documentation of their data assets.&#34; srcset=&#34;
               /media/img/blogposts_2021/Sisyphus_Bodleian_Library_hu99f0c1d6c82963b9538437670b4d339d_1662894_cd48a6c374c9ff68a08abe79a6abf2f4.webp 400w,
               /media/img/blogposts_2021/Sisyphus_Bodleian_Library_hu99f0c1d6c82963b9538437670b4d339d_1662894_a6eb1b13ff33a5c73aba34550964ff52.webp 760w,
               /media/img/blogposts_2021/Sisyphus_Bodleian_Library_hu99f0c1d6c82963b9538437670b4d339d_1662894_1200x1200_fit_q75_h2_lanczos_3.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/blogposts_2021/Sisyphus_Bodleian_Library_hu99f0c1d6c82963b9538437670b4d339d_1662894_cd48a6c374c9ff68a08abe79a6abf2f4.webp&#34;
               width=&#34;760&#34;
               height=&#34;507&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      Sisyphus was punished by being forced to roll an immense boulder up a hill only for it to roll down every time it neared the top, repeating this action for eternity.  This is the price that project managers and analysts pay for the inadequate documentation of their data assets.
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;p&gt;The practices related to the exploitation of open data are not only relevant in an open data context: these are good data ingestion and procurement practices for &lt;em&gt;any&lt;/em&gt; third party data, and in large organizations, for any cross-departmental data. (See the blogpost: &lt;a href=&#34;https://dataandlyrics.com/post/2021-07-08-data-sisyphus/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;The Data Sisyphus&lt;/a&gt;.)&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;Case Study:  &lt;a href=&#34;https://greendeal.dataobservatory.eu/post/2021-04-23-belgium-flood-insurance/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Belgian Drought/Flood Risk Awareness, Financial Capacity &amp;amp; Hydrology&lt;/a&gt; a complex integration of various open data sources.&lt;/p&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;In the second part of the presentation, Daniel talked about our modern data observatory concept.  We have reviewed about 80 functioning and already defunct international data collection programs.  Data observatories, like Copernicus’ Observatory, are permanent infrastructure to record various domain-specific data, such as alternative fuel information, information on homelessness, or on the European music business.  In our assessment, most of the EU, OECD, UNESCO recognized or endorsed observatories use obsolete technology and do not rely on the new achievements of data science. Reprex, our start-up offers an open source, open data based alternative solution to build largely automated data observatories.  We believe that human judgement is needed in data curation, but processing, documentation and validation is best done by computers.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Case Study: &lt;a href=&#34;https://greendeal.dataobservatory.eu/post/2021-03-06-regions-climate/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Reprocessing geographical information with administrative boundary changes&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p&gt;At last, he presented a few development directions with our open-source software, mentioning our work withing the rOpenGov community. This part of the presentation was originally meant to open the way for a half-day open data workshop, but due to the current pandemic situation, the physical part of the conference and the workshops were not held.&lt;/p&gt;
&lt;p&gt;The presentation largely included the topics of our Data &amp;amp; Lyrics blogpost: &lt;a href=&#34;https://greendeal.dataobservatory.eu/post/2021-06-18-gold-without-rush/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Open Data&amp;mdash;The New Gold Without the Rush&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&#34;presentation-slides&#34;&gt;Presentation Slides&lt;/h2&gt;
&lt;p&gt;See the presentation slides &lt;a href=&#34;https://reprex.nl/slides/crunchconf_2021/#/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;here&lt;/a&gt;.&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Open Data</title>
      <link>https://eviota.eu/data/open-gov/</link>
      <pubDate>Sun, 16 May 2021 00:00:00 +0000</pubDate>
      <guid>https://eviota.eu/data/open-gov/</guid>
      <description>&lt;p&gt;Many countries in the world allow access to a vast array of information,
such as documents under freedom of information requests, statistics,
datasets. In the European Union, most taxpayer financed data in
government administration, transport, or meteorology, for example, can
be usually re-used. More and more scientific output is expected to be
reviewable and reproducible, which implies open access.&lt;/p&gt;
&lt;table&gt;
&lt;tbody&gt;
&lt;tr class=&#34;odd&#34;&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;















&lt;figure  id=&#34;figure-whats-the-problem-with-open-datadataopen-govopen-data-problems&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;[What’s the Problem with Open Data?](/data/open-gov/#open-data-problems)&#34; srcset=&#34;
               /media/img/blogposts_2021/photo-1490004047268-5259045aa2b4_hu331aa960ddebc9d36b9a1e22e865106f_141153_ae0cb4c268f9c1c26caa19ff8480a54f.webp 400w,
               /media/img/blogposts_2021/photo-1490004047268-5259045aa2b4_hu331aa960ddebc9d36b9a1e22e865106f_141153_5f50346eb16b09053e80859ddd34afd5.webp 760w,
               /media/img/blogposts_2021/photo-1490004047268-5259045aa2b4_hu331aa960ddebc9d36b9a1e22e865106f_141153_1200x1200_fit_q75_h2_lanczos.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/blogposts_2021/photo-1490004047268-5259045aa2b4_hu331aa960ddebc9d36b9a1e22e865106f_141153_ae0cb4c268f9c1c26caa19ff8480a54f.webp&#34;
               width=&#34;760&#34;
               height=&#34;500&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;https://eviota.eu/data/open-gov/#open-data-problems&#34;&gt;What’s the Problem with Open Data?&lt;/a&gt;
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;















&lt;figure  id=&#34;figure-how-we-add-valuedataopen-govopen-data-value-added&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;[How We Add Value?](/data/open-gov/#open-data-value-added)&#34; srcset=&#34;
               /media/img/blogposts_2021/photo-1590247813693-5541d1c609fd_hu3d03a01dcc18bc5be0e67db3d8d209a6_248038_5f1fd418bebab4c2ebc0d0c2ca3af8ca.webp 400w,
               /media/img/blogposts_2021/photo-1590247813693-5541d1c609fd_hu3d03a01dcc18bc5be0e67db3d8d209a6_248038_6fd01a5fb846437bf228ba62c7ebace7.webp 760w,
               /media/img/blogposts_2021/photo-1590247813693-5541d1c609fd_hu3d03a01dcc18bc5be0e67db3d8d209a6_248038_1200x1200_fit_q75_h2_lanczos.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/blogposts_2021/photo-1590247813693-5541d1c609fd_hu3d03a01dcc18bc5be0e67db3d8d209a6_248038_5f1fd418bebab4c2ebc0d0c2ca3af8ca.webp&#34;
               width=&#34;760&#34;
               height=&#34;485&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;https://eviota.eu/data/open-gov/#open-data-value-added&#34;&gt;How We Add Value?&lt;/a&gt;
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;table&gt;
&lt;tbody&gt;
&lt;tr class=&#34;even&#34;&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;















&lt;figure  id=&#34;figure-is-there-value-in-itdataopen-govis-there-value-left-in-open-data-if-its-money-on-the-street-why-nobodys-picking-it-up&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;[Is There Value in It?](/data/open-gov/#is-there-value-left-in-open-data) If it’s money on the street, why nobody’s picking it up?&#34; srcset=&#34;
               /media/img/blogposts_2021/photo-1533580909002-a2f298d005eb_hu3d03a01dcc18bc5be0e67db3d8d209a6_216527_901e592f7c9f6e8ca557f4406f0f035c.webp 400w,
               /media/img/blogposts_2021/photo-1533580909002-a2f298d005eb_hu3d03a01dcc18bc5be0e67db3d8d209a6_216527_22bfc341b198033cdb1b86d89666cc2d.webp 760w,
               /media/img/blogposts_2021/photo-1533580909002-a2f298d005eb_hu3d03a01dcc18bc5be0e67db3d8d209a6_216527_1200x1200_fit_q75_h2_lanczos.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/blogposts_2021/photo-1533580909002-a2f298d005eb_hu3d03a01dcc18bc5be0e67db3d8d209a6_216527_901e592f7c9f6e8ca557f4406f0f035c.webp&#34;
               width=&#34;760&#34;
               height=&#34;507&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;https://eviota.eu/data/open-gov/#is-there-value-left-in-open-data&#34;&gt;Is There Value in It?&lt;/a&gt; &lt;/br&gt;If it’s money on the street, why nobody’s picking it up?
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;td style=&#34;text-align: center;&#34;&gt;















&lt;figure  id=&#34;figure-datasets-should-work-together-to-give-informationdataopen-govdata-integrationdata-is-only-potential-information-raw-and-unprocessed&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;[Datasets Should Work Together to Give Information](/data/open-gov/#data-integration)Data is only potential information, raw and unprocessed.&#34; srcset=&#34;
               /media/img/blogposts_2021/photo-1605143185650-77944b152643_hu06aa329509a03282a5595aa6ba78c818_94734_ae9420964ff046e7ae2a427c9ea41f0f.webp 400w,
               /media/img/blogposts_2021/photo-1605143185650-77944b152643_hu06aa329509a03282a5595aa6ba78c818_94734_4f942cac9675014ad1f5265a7d89c462.webp 760w,
               /media/img/blogposts_2021/photo-1605143185650-77944b152643_hu06aa329509a03282a5595aa6ba78c818_94734_1200x1200_fit_q75_h2_lanczos.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/blogposts_2021/photo-1605143185650-77944b152643_hu06aa329509a03282a5595aa6ba78c818_94734_ae9420964ff046e7ae2a427c9ea41f0f.webp&#34;
               width=&#34;760&#34;
               height=&#34;507&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      &lt;a href=&#34;https://eviota.eu/data/open-gov/#data-integration&#34;&gt;Datasets Should Work Together to Give Information&lt;/a&gt;&lt;/br&gt;Data is only potential information, raw and unprocessed.
    &lt;/figcaption&gt;&lt;/figure&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;
&lt;h2 id=&#34;open-data-problems&#34;&gt;What’s the Problem with Open Data?&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;“Data is stuff. It is raw, unprocessed, possibly even untouched by human
hands, unviewed by human eyes, un-thought-about by human minds.”&lt;/em&gt; [1]&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Most open data cannot be just &lt;a href=&#34;#open-data-faq&#34;&gt;&amp;ldquo;downloaded.&amp;rdquo;&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;Often, you need to put more than $100 value of &lt;a href=&#34;#is-there-value-left-in-open-data&#34;&gt;work&lt;/a&gt; into processing, validating, documenting a dataset that is worth $100. But you can share this investment with our data observatories.&lt;/li&gt;
&lt;li&gt;Open data is almost always lacking of documentation, and no clear references to validate if the data is reliable or not corrupted. This is why we always &lt;a href=&#34;#open-data-value-added&#34;&gt;start&lt;/a&gt; with reprocessing and redocumenting.&lt;/li&gt;
&lt;/ul&gt;
















&lt;figure  id=&#34;figure-our-review-of-about-80-eu-un-and-oecd-data-observatories-reveals-that-most-of-them-do-not-use-these-organizationss-open-data---instead-they-use-various-and-often-not-well-processed-proprietary-sources&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;#39;s open data - instead they use various, and often not well processed proprietary sources.&#34; srcset=&#34;
               /media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_0079ea9844f6c5e52b52fd0e627467a2.webp 400w,
               /media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_ecd6d08ba5e9bac19c8173546f036651.webp 760w,
               /media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_1200x1200_fit_q75_h2_lanczos_3.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/observatory_screenshots/observatory_collage_16x9_800_hu47f74f5cdae63c7248c2367b9d148671_353025_0079ea9844f6c5e52b52fd0e627467a2.webp&#34;
               width=&#34;760&#34;
               height=&#34;428&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      Our review of about 80 EU, UN and OECD data observatories reveals that most of them do not use these organizations&amp;rsquo;s open data - instead they use various, and often not well processed proprietary sources.
    &lt;/figcaption&gt;&lt;/figure&gt;
&lt;p&gt;Read more: &lt;a href=&#34;https://dataandlyrics.com/post/2021-06-18-gold-without-rush/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Open Data - The New Gold Without the
Rush&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&#34;open-data-value-added&#34;&gt;How We Add Value?&lt;/h2&gt;
&lt;ul&gt;
&lt;li&gt;We believe that even such generally trusted data sources as Eurostat
often need to be reprocessed, because various legal and political
constraints do not allow the common European statistical services to
provide optimal quality data – for example, on the regional and city
levels.&lt;/li&gt;
&lt;li&gt;With
&lt;a href=&#34;https://greendeal.dataobservatory.eu/authors/ropengov/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;rOpenGov&lt;/a&gt;
and other partners, we are creating open-source statistical software
in R to re-process these heterogenous and low-quality data into tidy
statistical indicators to automatically validate and document it.&lt;/li&gt;
&lt;li&gt;Metadata is a potentially informative data record about a
potentially informative dataset. We are carefully documenting and
releasing administrative, processing, and descriptive metadata,
following international metadata standards, to make our data easy to
find and easy to use for data analysts.&lt;/li&gt;
&lt;li&gt;We are automatically creating depositions and authoritative copies
marked with an individual digital object identifier (DOI) to
maintain data integrity.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;is-there-value-left-in-open-data&#34;&gt;Is There Value in Open Data?&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;A well-known story tells of a finance professor and a student who come across a $100 bill lying on the ground. As the student stops to pick it up, the professor says, “Don’t bother—if it were really a $100 bill, it wouldn’t be there.”&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;But this is not the case with open data.  Often, you need to put more than $100 into processing, validating, documenting a dataset that is worth $100.&lt;/p&gt;
&lt;p&gt;In the EU, open data is governed by the &lt;a href=&#34;https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1561563110433&amp;amp;uri=CELEX:32019L1024&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Directive on open data and the re-use of public sector information - in short: Open Data Directive (EU) 2019 / 1024&lt;/a&gt;. It entered into force on 16 July 2019. It replaces the &lt;a href=&#34;https://eur-lex.europa.eu/legal-content/en/ALL/?uri=CELEX:32003L0098&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Public Sector Information Directive&lt;/a&gt;, also known as the &lt;em&gt;PSI Directive&lt;/em&gt; which dated from 2003 and was subsequently amended in 2013.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Open Data&lt;/strong&gt; is &lt;em&gt;potentially&lt;/em&gt; useful data that can &lt;em&gt;potentially&lt;/em&gt; replace costlier or hard to get data sources to build information. It is analogous to potential energy: work is required to release it. We build automated systems that reduce this work and increase the likelihood that open data will offer the &lt;em&gt;best value for money&lt;/em&gt;.&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;Most open data is not publicy accessible, and available upon request. Our real curatorial advantage is that we know where it is and how to get this request processed.&lt;/li&gt;
&lt;li&gt;Most European open data comes from tax authorities, meteorological
offices, managers of transport infrastructure, and other
governmental bodies whose data needs are very different from yours.
Their data must be carefully evaluated, re-processed, and if
necessary, imputed to be usable for your scientific, business or
policy goals.&lt;/li&gt;
&lt;li&gt;The use of open science data is problematic in different ways:
usually understanding the data documentation requires
domain-specific specialist knowledge. &lt;a href=&#34;https://eviota.eu/data/open-science/&#34;&gt;Open science
data&lt;/a&gt; is even more scattered and difficult to
access than technically open, but not public governmental data.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;data-integration&#34;&gt;From Datasets to Data Integration, Data to Information&lt;/h2&gt;
&lt;p&gt;“Data is only potential information, raw and unprocessed, prior to
anyone actually being informed by it.” ^[2]&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;We are building simple databases and supporting APIs that release
the data without restrictions, in a tidy format that is easy to join
with other data, or easy to join into databases, together with
standardized metadata.&lt;/li&gt;
&lt;/ul&gt;
















&lt;figure  id=&#34;figure-our-service-flow-and-value-chain&#34;&gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img alt=&#34;Our service flow and value chain&#34; srcset=&#34;
               /media/img/slides/automated_observatory_value_chain_huf9c0a6d9b150a8fdeb42cadf99abee90_616274_c18a97f00bbcac322614b6c2d55783f6.webp 400w,
               /media/img/slides/automated_observatory_value_chain_huf9c0a6d9b150a8fdeb42cadf99abee90_616274_8b655e803b41b817a8093a37ccd19689.webp 760w,
               /media/img/slides/automated_observatory_value_chain_huf9c0a6d9b150a8fdeb42cadf99abee90_616274_1200x1200_fit_q75_h2_lanczos.webp 1200w&#34;
               src=&#34;https://eviota.eu/media/img/slides/automated_observatory_value_chain_huf9c0a6d9b150a8fdeb42cadf99abee90_616274_c18a97f00bbcac322614b6c2d55783f6.webp&#34;
               width=&#34;760&#34;
               height=&#34;428&#34;
               loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;figcaption&gt;
      Our service flow and value chain
    &lt;/figcaption&gt;&lt;/figure&gt;
&lt;h2 id=&#34;open-data-faq&#34;&gt;FAQ&lt;/h2&gt;
&lt;h3 id=&#34;why-downloading-does-not-work&#34;&gt;Why Downloading Does Not Work?&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Most open data is not available on the internet.&lt;/li&gt;
&lt;li&gt;If it is available, it is not in a form that you can easily import into a spreadsheet application like Excel or OpenOffice, or into a statistical application like SPSS or STATA.&lt;/li&gt;
&lt;li&gt;Even the data quality of trusted web sources, like the Eurostat website, can be very low. Eurostat just publishes what it gets from governments, and often has no mandate to fix errors.  The data is full with missing information, and in the case of regional statistics, faulty region codes and region names that make matching your data or placing them on a map impossible.&lt;/li&gt;
&lt;li&gt;Adjusting euros with millions of euros, correctly translating dollars to euros, pounds to kilograms requires plenty of work. This is a very error-prone process when done by humans.&lt;/li&gt;
&lt;/ul&gt;
&lt;h3 id=&#34;can-open-data-be-used-in-machine-learning-and-ai&#34;&gt;Can Open Data be Used in Machine Learning and AI?&lt;/h3&gt;
&lt;ul&gt;
&lt;li&gt;Most public and open data sources have many missing observations; machine learning models usually cannot hanlde missingness. These points must be carefully imputed with approximations, which can be very challenging when the data has geographical dimension.&lt;/li&gt;
&lt;li&gt;Removing missing values makes samples extremely biased and your model will learn from omissions, not information.&lt;/li&gt;
&lt;/ul&gt;
&lt;h2 id=&#34;photo-credits&#34;&gt;Photo Credits&lt;/h2&gt;
&lt;p&gt;&lt;em&gt;What&amp;rsquo;s the Problem with Open Data?&lt;/em&gt; illustration is a photo by &lt;a href=&#34;https://unsplash.com/photos/8hJQKRIQZMY&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Cristina Gottardi&lt;/a&gt;
&lt;em&gt;How We Add Value?&lt;/em&gt; illustration is a photo by &lt;a href=&#34;https://unsplash.com/photos/IEiAmhXehwE&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Nana Smirnova&lt;/a&gt;.
&lt;em&gt;Is There Value Left in It?&lt;/em&gt; is a photo by &lt;a href=&#34;https://unsplash.com/photos/GcnPjvqRL18&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Imelda&lt;/a&gt;
&lt;em&gt;Datasets Should Work Together to Give Information&lt;/em&gt; is a photo by &lt;a href=&#34;https://unsplash.com/photos/huRn8ECqADI&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Lucas Santos&lt;/a&gt;&lt;/p&gt;
&lt;h2 id=&#34;footnote-references&#34;&gt;Footnote References&lt;/h2&gt;
&lt;p&gt;[1] Pomerantz, Jeffrey. 2021. “Metadata.” MIT Press essential knowledge
series. MIT Press. Cambridge, Massachusetts ; London, England : The MIT
Press, [2015]&lt;/p&gt;
&lt;p&gt;[2] Pomerantz, Jeffrey. 2021. “Metadata.” MIT Press essential knowledge
series. MIT Press. Cambridge, Massachusetts ; London, England : The MIT
Press, [2015]&lt;/p&gt;
</description>
    </item>
    
    <item>
      <title>Reprex Open Data Day 2021</title>
      <link>https://eviota.eu/talk/reprex-open-data-day-2021/</link>
      <pubDate>Sat, 06 Mar 2021 15:30:00 +0200</pubDate>
      <guid>https://eviota.eu/talk/reprex-open-data-day-2021/</guid>
      <description>&lt;p&gt;&lt;a href=&#34;https://opendataday.org/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Open Data Day&lt;/a&gt;  is an annual celebration of open data all over the world. It is an opportunity to show the benefits of open data and encourage the adoption of open data policies in government, business, and civil society. Reprex is a start-up that utilizes open data with open-source reproducible research: please challenge us with your data requests and participate in our web events.&lt;/p&gt;
&lt;p&gt;The &lt;code&gt;Reprex Open Data Day 2021&lt;/code&gt; will be two informal conversations based on a series of run up introductory blogposts centered around two themes. Because important guests became ill in the last days, we are going to consolidate the two talks into one with less structure.  We want to create an informal, inclusive, collaborative online event on International Open Data Day 2021. Please, grab a tea, coffee, or even a beer, and join us for an informal conversation. We hope that we will finish the afternoon with ideas on new, open-data driven collaborations.&lt;/p&gt;
&lt;p&gt;&lt;code&gt;9.30 EST / 15.30 CET&lt;/code&gt;:  &lt;strong&gt;Open collaboration in business, policy and science.&lt;/strong&gt;   Creating evidence-based policy, business strategy or scientific research with small contributions with independent components with incentives.  Short introduction with examples:  joining environmental sensory data and public opinion data on maps; creating harmonized datasets across the Arab world.  Survey harmonization, mapping, data products.  &lt;strong&gt;Scaling up open collaboration: making small organizations competitive with big tech in the big data era.&lt;/strong&gt;  Data sharing, data pooling, data altruism and observatories. The new European trustworthy AI and data governance agenda.&lt;/p&gt;
&lt;p&gt;You can &lt;a href=&#34;https://eviota.eu/presentations/reprex_open_data_day_2021.html#/reprex&#34;&gt;click through&lt;/a&gt; a short presentation to familiarize yourself with our topics.&lt;/p&gt;
&lt;p&gt;See you &lt;a href=&#34;https://meet.jit.si/ReprexOpenDataDay2021&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;here&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Case studies:&lt;/strong&gt;&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;We are connecting raw survey data about Climate Awareness in Eurobarometer surveys.  Here is the &lt;a href=&#34;https://rpubs.com/antaldaniel/734594&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;reproduction code&lt;/a&gt; (&lt;em&gt;intermediate to advanced R needed&lt;/em&gt;.) You should use the &lt;em&gt;development&lt;/em&gt; version of our &lt;a href=&#34;retroharmonize.dataobservatory.eu&#34;&gt;retroharmonize&lt;/a&gt; package at &lt;a href=&#34;https://github.com/antaldaniel/retroharmonize&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;github.com/antaldaniel/retroharmonize&lt;/a&gt;&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;We are tracking changes in the boundaries of provinces, states, counties, parishes with our regions open source software &amp;ndash; &lt;a href=&#34;https://rpubs.com/antaldaniel/regions-OOD21&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;reproduction code here&lt;/a&gt;. You will need our &lt;a href=&#34;regions.dataobservatory.eu&#34;&gt;regions&lt;/a&gt; package which is available on CRAN or in the rOpenGov GitHub repo.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;We will talk about how to join this with air pollution data and put it on the map with &lt;a href=&#34;https://dataandlyrics.com/post/2021-03-03-ood_interview_maps/&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Milos Popovic&lt;/a&gt;, who prepared this nice choropleth animation.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;















&lt;figure  &gt;
  &lt;div class=&#34;d-flex justify-content-center&#34;&gt;
    &lt;div class=&#34;w-100&#34; &gt;&lt;img src=&#34;https://eviota.eu/media/gif/eu_climate_change.gif&#34; alt=&#34;Milos Popovic&amp;amp;rsquo;s maps made from the case study.&#34; loading=&#34;lazy&#34; data-zoomable /&gt;&lt;/div&gt;
  &lt;/div&gt;&lt;/figure&gt;
&lt;/p&gt;
&lt;ol start=&#34;4&#34;&gt;
&lt;li&gt;We will discuss data observatories (permanent data collection programs), open collaboration (open-source inspired way of cooperation among small and large independent actors) and data altruism.&lt;/li&gt;
&lt;/ol&gt;
&lt;p&gt;Any questions: send Daniel a message on &lt;a href=&#34;https://keybase.io/antaldaniel&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;Keybase&lt;/a&gt;, Whatsapp or &lt;a href=&#34;https://dataandlyrics.com/#contact&#34; target=&#34;_blank&#34; rel=&#34;noopener&#34;&gt;email&lt;/a&gt;.&lt;/p&gt;
&lt;blockquote class=&#34;twitter-tweet&#34;&gt;&lt;p lang=&#34;en&#34; dir=&#34;ltr&#34;&gt;Hello on International &lt;a href=&#34;https://twitter.com/hashtag/OpenDataDay2021?src=hash&amp;amp;ref_src=twsrc%5Etfw&#34;&gt;#OpenDataDay2021&lt;/a&gt; from🌷 the Hague!&lt;br&gt;- We have brought some new data to the light about 🌡climate change awareness &lt;br&gt;- We created some tutorials how to harmonize survey and geographical data&lt;br&gt;- Join us at 9.30 EST/15.30 CET 👇&lt;a href=&#34;https://t.co/7J7pvi3sPC&#34;&gt;https://t.co/7J7pvi3sPC&lt;/a&gt; &lt;a href=&#34;https://twitter.com/hashtag/ODD2021?src=hash&amp;amp;ref_src=twsrc%5Etfw&#34;&gt;#ODD2021&lt;/a&gt; &lt;a href=&#34;https://t.co/DwkGQaDhW1&#34;&gt;pic.twitter.com/DwkGQaDhW1&lt;/a&gt;&lt;/p&gt;&amp;mdash; dataandlyrics (@dataandlyrics) &lt;a href=&#34;https://twitter.com/dataandlyrics/status/1368149535436996609?ref_src=twsrc%5Etfw&#34;&gt;March 6, 2021&lt;/a&gt;&lt;/blockquote&gt; &lt;script async src=&#34;https://platform.twitter.com/widgets.js&#34; charset=&#34;utf-8&#34;&gt;&lt;/script&gt;
</description>
    </item>
    
  </channel>
</rss>
