open tech digital cholera
TRANSCRIPT
![Page 1: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/1.jpg)
Digital CholeraPeter Wells - @peterkwells
OpenTech - June 2015
![Page 2: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/2.jpg)
CholeraThe last time I was in this building I went to a talk on an early example of data analysis and data visualisation.
John Snow famously traced a fatal cholera epidemic in Soho in 1854 to a local water pump.
Because of cholera in the pump the water was not safe to use.
Read more about John Snow: http://en.wikipedia.org/wiki/John_Snow_%28physician%29@peterkwells
![Page 3: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/3.jpg)
Cholera and infrastructureThe Soho outbreak started at a water pump, it could have been a water reservoir.
The cholera bacteria would spread and contaminate the water downstream. An entire set of water infrastructure could have been contaminated.
The water would not have been safe to use. Yet water is essential to life.
Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/@peterkwells
![Page 4: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/4.jpg)
Safe waterAs a society we invest in water infrastructure. We have:
- inspections- alerting systems- purification- education
We put more focus at the top of the infrastructure, on water producers and distributors, than we do on water users.
The goal is to make water that’s safe for people to use.
A Doctor from the World Health Organisation
@peterkwells
![Page 5: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/5.jpg)
Get to the digital...
@peterkwells
![Page 6: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/6.jpg)
Open AddressesOrganisations have to buy lists of UK addresses, licensing is complicated, the quality isn’t great, the data doesn’t meet all the needs.
It’s hard to build new services.Open Addresses explored whether it was possible to build a new UK address list, to make things simpler and make addresses more widely used.
@peterkwells
![Page 7: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/7.jpg)
Addressing needsDenmark had a 1000% increase in the organisations that use address data by making address data simpler to use.
We discovered other needs and benefits:
- people who move into new houses need their addresses to be published faster- people name their houses and need other people to know about it- people need it to be easier to enter addresses on websites- (I could go on…)
@peterkwells
More and better services that would make life a little bit easier
![Page 8: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/8.jpg)
Getting addressesAs well as understanding the needs we had to find data.
There are 26-40m addresses in the UK.
The Land Registry publishes over 18 million addresses in the Price Paid Dataset. Sounds great!
@peterkwellsAside: we also did some neat stuff on mathematical inference for addresses. Check out www.openaddressesuk.org...
![Page 9: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/9.jpg)
Land Registry says no...
Image from Owen Boswarva: http://mapgubbins.tumblr.com/post/107499166390/it-was-all-a-dream-land-registrys-price-paid@peterkwells
![Page 10: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/10.jpg)
Third Party Rights are complex and can be fatalAddress datasets can include third-party database rights:
1. if the data was directly copied from an existing address database2. if an existing list of addresses (obtained through another route) was corrected orvalidated based on an existing address database
Unauthorised use of third party rights creates risk for both data publishers and consumers.
The service can simply…... stop.
@peterkwells
![Page 11: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/11.jpg)
Third party rights, they’re everywhere!As we inspected other datasets we saw similar issues with unauthorised rights:
- websites for data capture that used third party address products- datasets that had been cleansed with third party address products- a clean website followed by automated back-end validation
Even with submission guidelines, provenance tracking and takedown policies the legal position for Open Addresses was really complex.
We made a :(@peterkwells
![Page 12: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/12.jpg)
LightbulbIt is complicated to determine if unauthorised third party rights exist. You need to inspect the data and how it was produced
@peterkwells
Image by Richard Rutter: https://www.flickr.com/photos/clagnut/
![Page 13: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/13.jpg)
Safe water - a repriseAs a society we invest in water infrastructure:
- inspections- alerting systems- purification- education
We put more focus at the top of the infrastructure, on water producers and distributors, than we do on water users.
The goal is to make water that’s safe for people to use.
Image CC-BY-2.0 by Woodley Wonderworks: https://www.flickr.com/photos/wwworks/@peterkwells
A Doctor from the World Health Organisation
![Page 14: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/14.jpg)
Digital cholera
@peterkwells
Copyright is a good thing (don’t believe me? ask a musician) so I’m using a harsh metaphor, but the metaphor is useful.
Don’t take away my copyright!
![Page 15: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/15.jpg)
Digital cholera
@peterkwells
The water may be infected with cholera.
Therefore we inspect it to see if the water is safe to use.
Land Registry address data may be infected with digital cholera.
Therefore we inspect it to see if the data is safe to use.
We learnt it wasn’t so we didn’t….
![Page 16: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/16.jpg)
Digital cholera
@peterkwells
Not just about unauthorised third party rights.
Inappropriate releases of personal data.
Incomplete data.
Incorrect data.
Remember it’s a metaphor.
![Page 17: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/17.jpg)
Digital cholera
@peterkwells
Can we learn more from how society learnt to deal with cholera in water?
![Page 18: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/18.jpg)
Alerting system?
@peterkwells
We’ve told Land Registry of the problem(s).
We’ve published articles to alert others.
We’re here.
Should this be better?
![Page 19: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/19.jpg)
Purification?
@peterkwells
Tricky. There is no equivalent of a purification tablet.
We need to cleanse data infrastructure of digital cholera or we need to rebuild it.
It is simplest if the data is kept pure by whoever creates and maintains it.
Just as with water.
![Page 20: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/20.jpg)
Education
@peterkwells
The ODI already have a wealth of education material and are including the thinking and learning from Open Addresses in some future work:
Send your ideas more here:http://theodi.org/who-owns-our-data-infrastructure?
![Page 21: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/21.jpg)
Water is essential to life so we invest in maintaining our water infrastructure to make water safe to use.
Data gives us more and better services. It is is essential to life. We need to invest in maintaining useful data infrastructure to make data safe to use.
@peterkwells
![Page 22: Open tech digital cholera](https://reader035.vdocument.in/reader035/viewer/2022062420/55c5d11cbb61eba0798b4660/html5/thumbnails/22.jpg)
@peterkwellsImage by Don Graham: https://www.flickr.com/photos/23155134@N06/
If we don’t look after our data infrastructure we risk simply ending up with some rusty and unused data pumps….