getting to know ckan, 24 june 2015

67
STEVEN DE COSTA Steering Group Member @STARL3N GETTING TO KNOW CKAN

Upload: steven-de-costa

Post on 07-Aug-2015

115 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Getting to Know CKAN, 24 June 2015

STEVEN DE COSTASteering Group Member

@STARL3N

GETTING TO KNOW CKAN

Page 2: Getting to Know CKAN, 24 June 2015
Page 3: Getting to Know CKAN, 24 June 2015

Introduction 3

Link Digital Executive Director

CKAN AssociationSteering Group Member

Open Knowledge AustraliaCo-Secretary & Treasurer

Data ShadesFounder

GovHackNational Organiser

AWS User GroupCBR Organiser

Page 4: Getting to Know CKAN, 24 June 2015

STEVEN DE COSTASteering Group Member

@STARL3N

Page 5: Getting to Know CKAN, 24 June 2015

STEVEN DE COSTASteering Group Member

@STARL3N

Page 6: Getting to Know CKAN, 24 June 2015

Data portal software:1. Open Source2. Large and expanding installation base

within Government worldwide3. Expanding use cases in the wider data

ecosystem4. Python web app, PostgreSQL DB5. Built for machines,

custodians and end users

WHAT IS CKAN?

Page 7: Getting to Know CKAN, 24 June 2015

LEARN MORE AT CKAN.org

Page 8: Getting to Know CKAN, 24 June 2015

DOWNLOAD FROM github.com/ckan/ckan

Page 9: Getting to Know CKAN, 24 June 2015

1. >> Organisations (optionally with sub-organisations)2. >> >> Datasets3. >> >> >> Resources

4. >> Platform Custodian5. >> >> Organisation Custodian, Editor or Member

6. >> Published or Private datasets

CKAN STRUCTURE

Page 10: Getting to Know CKAN, 24 June 2015

1. >> Constitution2. >> >> Parliamentary Legislation and Acts (Jurisdiction = Platform)3. >> >> >> Ministries (Organisation)4. >> >> >> >> Programs (Sub-Organisations)5. >> >> >> >> >> Projects (Datasets)6. >> >> >> >> >> >> Outcomes (Resources)

CKAN USE CASE PARADIGM

Page 11: Getting to Know CKAN, 24 June 2015

1. User registration2. User management3. Custodian workflows (manage datasets and data resources)4. Directory Browse by organisation or group5. Faceted search for multiple fields (supporting end user discovery)6. Resource views to preview data (a recently improved feature)7. Metadata view

CKAN UI

Page 12: Getting to Know CKAN, 24 June 2015

1. Create an organistation (usually done by platform owner)2. Login as member of organisation3. Click ‘add dataset’ 4. Step 1: Add a title, description and other metadata5. Step 2: Add resources (links to data or upload data files for hosting)6. Step 3: Add any additional info

CKAN CUSTODIAN WORKFLOW

Page 13: Getting to Know CKAN, 24 June 2015

1. Title2. Description: Using markdown as required3. Keywords: some keywords (or tags) that describe your data.4. License (required): a dropdown of available licenses for data.gov.au (the default is Creative

Commons Attribution 3.0 Australia)5. Organisation: a dropdown of organisations you can publish to. Most users can only publish to a

single organisation. This will be automatically filled in.6. Visibility: whether the dataset will be viewable to all users once complete. The default is private.7. Geospatial Coverage (required): inherited from organisation metadata this is the area which the

data covers. It can be; a point/polygon (Well-known text); an administrative boundary API; or, a reference URL (website address) from the National Gazatteer. Gazetteer reference URLs can be found by searching for a place at http://www.ga.gov.au/place-names/ then clicking through to the most appropriate location "Reference ID", and then copying and pasting the URL from the page into the Geospatial field in data.gov.au.

CKAN METADATA FOR DATA.GOV.AU

Page 14: Getting to Know CKAN, 24 June 2015

8. Temporal Coverage From / To (required): the span of time from/to which the data is applicable. If the data applies only to a single point in time you should only fill in the Temporal Coverage From field.

9. Language: the language in which the dataset is published. The default is English.10. Data Status (required): the status of the data with regard to whether it is kept updated (active,

yes) or historic (inactive, no).11. Update Frequency (required): how often the dataset is updated. Eg: Daily, Weekly, Never. (for

remote machine readable files this field will be used to fetch new versions of this data)12. Expose User Contact Information: display additional contact information for the dataset.13. AGIFT Function/Theme: the AGIFT top level government function to which the dataset relates.14. Publisher: name of Agency/publishing organisation. The default is set to the organisation’s name.15. Jurisdiction: name of the jurisdiction in which the dataset belongs. The default is set to the

organisation’s jurisdiction.

CKAN METADATA FOR DATA.GOV.AU

Page 15: Getting to Know CKAN, 24 June 2015

1. Join the dev mailing list (monitored by tech team):https://lists.okfn.org/mailman/listinfo/ckan-dev

2. Search Stack overflow under CKAN:http://stackoverflow.com/search?q=CKAN

3. Check the roadmap on waffle.io:https://waffle.io/ckan/ideas-and-roadmap

4. Join a tech team meeting:http://ckan.org/about/technical-team/

WHERE TO GET HELP

Page 16: Getting to Know CKAN, 24 June 2015

CKAN API DOCUMENTATION

Page 17: Getting to Know CKAN, 24 June 2015

1. Get JSON-formatted lists of a site’s datasets, groups or other CKAN objects

2. Get a full JSON representation of a dataset, resource or other object

3. Search for packages or resources matching a query

4. Create, update and delete datasets, resources and other objects

5. Get an activity stream of recently changed datasets on a site

CKAN API

Page 18: Getting to Know CKAN, 24 June 2015

1. CKAN as an Information Asset Register

2. FileStore – For hosting of data and resources

3. DataStore - provides a database for structured storage of data together with a powerful Web-accessible Data API

4. License Selection (machine ready?)

5. Harvesting

A FEW MORE POINTS

Page 19: Getting to Know CKAN, 24 June 2015

Delivering the world’s best open data management systemThe purpose of the CKAN Association is to support sustainable growth and development of CKAN while also protecting the interests of the CKAN community.The Association values a healthy and thriving community which continues to deliver the best open data management system in the world.

CKAN ASSOCIATION: STATEMENT OF PURPOSE

Page 20: Getting to Know CKAN, 24 June 2015

Community interests, or needs, can be generalised as:1. Users need an enterprise level open data management system tailored to meet their needs

now and into the future.2. Individual contributors need a project that is rewarding to work for, inclusive and active.

COMMUNITY INTERESTS

Page 21: Getting to Know CKAN, 24 June 2015

Who owns or directly manages the CKAN project? The project, its releases and future direction are cooperatively managed by its community of users and contributors. Association delegates, staff and office holders may be active within the project but will exercise no more or less influence than any other contributor or user.

More information about CKAN and how to contribute can be found at CKAN.org.

DELEGATION TO SERVE AND PROTECT

Page 22: Getting to Know CKAN, 24 June 2015

What is the current structure of the CKAN Association?See http://ckan.org The steering group carries on a number of business activities.This includes raising revenue, managing resources and directing projects or programs of activity relevant to the CKAN Association’s statement of purpose.

COMPLETENESS OF PURPOSE

Page 23: Getting to Know CKAN, 24 June 2015

FORMAL MEMBERSHIPS

Page 24: Getting to Know CKAN, 24 June 2015

It's easy to find people willing to fund or work on the flashier parts of CKAN, and there is some funding for core work such as meetings and preparing releases. But there are a few important things that aren't getting addressed.

Fix Security IssuesThere are a small number of known issues that high profile sites work around by implementing workflows and tight controls over users and data published. Everyone else would benefit from a more-secure-by-default CKAN.Clear Technical DebtCKAN, has some parts that should be replaced or removed. This work doesn't add any new features and worse, threatens to break upgrades for existing sites and extensions. It is necessary work though. Unless performance and technical limitations are addressed they will grow and start turning users away from CKAN.

Ref: https://github.com/ckan/ideas-and-roadmap/issues/152 - Raised by Ian Ward, Tech Team Lead

HOW MEMBERSHIPS BENEFIT CKAN

Page 25: Getting to Know CKAN, 24 June 2015

1. Large installation base of significant platforms2. Open Source projects favoured by technical influencers, driving adoption3. Historical association with Open Knowledge adds significant ‘open’ credibility4. Support via Open Knowledge opens direct and indirect funding support5. Global awareness of CKAN due to many years in the market6. CKAN Association is established with strong representatives from three continents7. Established, experienced and professional technical team8. Extension model allows for customisations9. CKAN can be integrated with WordPress or Drupal10. Free to use11. A number of private companies now specialise in CKAN services12. Public roadmap for future features

OUR STRENGTHS

Page 26: Getting to Know CKAN, 24 June 2015

1. Current reliance on Open Knowledge funded contributors2. Technical debt (python framework needs replacement)3. Relatively* low activity on roadmap feature development4. Community and Communications Team lacks resources5. Technical team is relatively* small6. Relatively* small number of private companies specialise in CKAN services

* Compare with projects with such significant adoption in enterprises

OUR WEAKNESSES

Page 27: Getting to Know CKAN, 24 June 2015

1. Partnerships within data and civic app sector2. Many markets where CKAN is being considered3. Markets are growing4. CKAN is a technically interesting project for potential contributors5. Attractive to internal technical teams within Government6. Use case ‘distributions’ to meet new verticals7. Use case extensions can increase the value of current installations8. Regular update releases supporting a vendor community9. Vendor community can grow awareness, demand and installations of CKAN

OUR OPPORTUNITIES

Page 28: Getting to Know CKAN, 24 June 2015

10. An effective communications effort can grow awareness of the project11. A modified license would allow CKAN to be integrated with commercial applications12. Revenue from formal membership tiers can fund business activities13. In-kind contributions from formal membership can advance business activities14. Domain specific working groups can align the project with various global activities and market

segments15. Ensuring broad interoperability to widen use cases and adoption within various solution 16. Direct alignment with OK initiatives, such as OpenSpending, for increasing the value of CKAN

OUR OPPORTUNITIES

Page 29: Getting to Know CKAN, 24 June 2015

1. Alternatives to CKAN could gain in popularity, degrading contributor interest2. Project is overly reliant on support from Open Knowledge3. Possibility of user churn out, degrading the user base4. Supply of enterprise services fails to meet demand for services, killing adoption5. Project could fork in too many directions if doesn’t evolve to meet user needs over the long

term.6. Contributors churn out, leaving the community without new contributors entering

OUR THREATS

Page 30: Getting to Know CKAN, 24 June 2015

PRIVATE SECTOR OPPORTUNITIES

Page 31: Getting to Know CKAN, 24 June 2015

The DataCo Vision Today 31

DataCo brings together the supply and demand sides of the market place, which stimulates the private sector and ensures access by the public to geographical information. Part of our role is to help the contributor’s disparate data resources mesh, cooperate and function uniformly to create high-quality products.

Page 32: Getting to Know CKAN, 24 June 2015

Challenges to The DataCo Vision 32

• Public conversation on data has moved beyond enterprise ICT and large monopolistic supply chains.

• The convergence between public cloud, IoT and analytics is raising the public awareness of Big Data’s value to the economy as a platform for innovation.

• The convergence between open source and open government has raised awareness surrounding open data as a driver for economic redistributions of value from the few to the many.

• Data is a special kind of economic good which people are learning to take for granted as open by default.

Page 33: Getting to Know CKAN, 24 June 2015

The DataCo Vision Tomorrow 33

DataCo’s role is to help disparate data resources mesh, cooperate and function uniformly to create high-quality products. We supply this data under a variety of commercial arrangements which enable our customers to develop products which they can rely on for high quality geographical information.

Page 34: Getting to Know CKAN, 24 June 2015

Foundations of Open Government 34

My Administration is committed to creating an unprecedented level of openness in Government. We will work together to ensure the public trust and establish a system of transparency, public participation, and collaboration. Openness will strengthen our democracy and promote efficiency and effectiveness in Government.

Sources: http://www.whitehouse.gov/the_press_office/TransparencyandOpenGovernmentHope Poster by Shepard Fairey: http://en.wikipedia.org/wiki/File:Barack_Obama_Hope_poster.jpg

Page 35: Getting to Know CKAN, 24 June 2015

The role of Government in Australia 35

It should support civil society and its multiplicity of voices and activities.

It should provide the economic framework and the essential infrastructure for public and private enterprise.

Source: http://www.australiancollaboration.com.au/pdf/FactSheets/Role-government-FactSheet.pdf

Page 36: Getting to Know CKAN, 24 June 2015

Governance and Provenance in Singapore 36

Constitution: established 1965

Parliament: Policy, Legislation and Administrative Acts

Ministries: Prime Minister’s Office

Entities and Programs: Smart Nation

Projects: Data 101

Outcomes: Attendees

Page 37: Getting to Know CKAN, 24 June 2015

Collaboration is essential 37

Why is collaboration essential for:

innovation? < multidisciplinary skills and experience

digital transactions? < stakeholder and user participation

data classification? < early identification and ongoing governance

security? < peer review and bright lights

Page 38: Getting to Know CKAN, 24 June 2015

Business Cases: Theory of the firm & Transaction costs 38

Page 39: Getting to Know CKAN, 24 June 2015

Where is there work to do with Open Data? 39

Page 40: Getting to Know CKAN, 24 June 2015

40Link Digital’s Network Map

Page 41: Getting to Know CKAN, 24 June 2015

Published Data Handbook 41

Page 42: Getting to Know CKAN, 24 June 2015

Published Data Handbook 42

Recipe Name What you’ll need Ingredients Method Perfect for…

Page 43: Getting to Know CKAN, 24 June 2015

43Establishing Data Publishing Policy

Page 44: Getting to Know CKAN, 24 June 2015

44Establishing Data Management Systems

Page 45: Getting to Know CKAN, 24 June 2015

45Educating Data Publishers

Page 46: Getting to Know CKAN, 24 June 2015

46Facilitation within tech communities

Page 47: Getting to Know CKAN, 24 June 2015

Establishing enterprise solution architecture 47

Page 48: Getting to Know CKAN, 24 June 2015

48Extract, Transform and Load Connectors

Page 49: Getting to Know CKAN, 24 June 2015

49Cleaning of Messy Data

Page 50: Getting to Know CKAN, 24 June 2015

50Geospatial Integration

Page 51: Getting to Know CKAN, 24 June 2015

51Resource View Integration

Page 52: Getting to Know CKAN, 24 June 2015

52Enriched Data and Analytics

Page 53: Getting to Know CKAN, 24 June 2015

Types of business models 53

Specialist services

Managed Services

Product Integrationwith monetisation options established with the platform owner

Page 54: Getting to Know CKAN, 24 June 2015

54The perfect storm

Page 55: Getting to Know CKAN, 24 June 2015

55Drupal interface

Page 56: Getting to Know CKAN, 24 June 2015

56CKAN interface

Page 57: Getting to Know CKAN, 24 June 2015

57What the DFMP does

Page 58: Getting to Know CKAN, 24 June 2015

58Launched Globally as CKAN Galleries

Page 59: Getting to Know CKAN, 24 June 2015

59Launched Globally as CKAN Galleries

Page 60: Getting to Know CKAN, 24 June 2015

60Government as an API - Digital Transformation Office

Page 61: Getting to Know CKAN, 24 June 2015

61Data.vic.gov.au on CKAN, Drupal and AWS

Page 62: Getting to Know CKAN, 24 June 2015

Classifying data 62

The three tiers are:

internal

whole of Government

open

Page 63: Getting to Know CKAN, 24 June 2015

63Data.vic.gov.au data classification

Page 64: Getting to Know CKAN, 24 June 2015

64The ‘go. no go’ gates for going open

Page 65: Getting to Know CKAN, 24 June 2015

Take Note: What is NOT good 65

Page 66: Getting to Know CKAN, 24 June 2015

Take Note: What is best… 66

Page 67: Getting to Know CKAN, 24 June 2015