<%BANNER%>

UFIR



Identifying Data Providers
www.ufl.edu ( Publisher's URL )
CITATION DOWNLOADS PDF VIEWER
Full Citation
STANDARD VIEW MARC VIEW
Permanent Link: http://ufdc.ufl.edu/IR00000963/00001
 Material Information
Title: Identifying Data Providers
Physical Description: Presentation
Creator: Conlon, Michael
Publisher: University of Florida
Place of Publication: Boulder, CO
Publication Date: May 15, 2012
 Notes
Acquisition: Collected for University of Florida's Institutional Repository by the UFIR Self-Submittal tool. Submitted by Michael Conlon.
Publication Status: Published
 Record Information
Source Institution: University of Florida Institutional Repository
Holding Location: University of Florida
Rights Management: All rights reserved by the submitter.
System ID: IR00000963:00001

Downloads
Full Text

PAGE 1

I DENTIFYING AND A PPROACHING D ATA P ROVIDERS ; F ULL L IFE C YCLE D ATA M ANAGEMENT Mike Conlon Kristi Holmes

PAGE 2

Data for VIVO What data will you need? Authoritative Sources Data owners, holders, stewards, providers Examples, pitfalls, successes The role of IT

PAGE 3

What Data will You Need? Data on People current and past positions, contact information, photos, awards, service activities, identifiers Organizations structure, identifiers, web sites Papers Grants Other s cholarly works books, chapters, abstracts, posters, presentations, art, music Mentoring relationships Patents Courses

PAGE 4

Automated Data Feeds Some data (most?) will come from automated feeds from existing systems Registrar, Faculty Reporting, Grants Management, Institutional Repository Some systems are relatively complete (all faculty, all data elements regarding positions), many are not Some systems have good data, many do not What is the tolerance for incomplete, incorrect data? What can be done to improve data quality?

PAGE 5

Manual Data Entry data Scripps Research Institute (250 faculty) Ponce Medical School (35 faculty) UF Agriculture (800 faculty) Some faculty will want to review/edit their data Central proxy edit Distributed proxy edit Self edit

PAGE 6

How Will You Decide? Planning Committee Identify and engage stakeholders Create a governance structure Strategy Lots of others

PAGE 7

Example: University of Florida

PAGE 8

University of Florida Data People from the University directory, UFID Positions from HR, PositionID Organizations from PeopleSoft, manual, DEPTID Grants from Division of Sponsored Research SponsorIDs AwardID Papers from PubMed, PubMedID from Thomson Reuters, DOI Photos from ID Cards, Business Services Division, UFID Courses from Registrar via Enterprise Data Warehouse, Course and Section Numbers Overview, research areas, awards, research interests, memberships, posters, abstracts, presentations, patents, software, featured in, teaching overview, service, education, keywords entered manually via central proxy, local proxies and self edit

PAGE 9

Talking to Data Stewards What motivates the data steward? We understand university procedure, policy, culture We have the support of the university We love, understand and respect data

PAGE 10

Example: University Registrar History Relationship Relationship to VIVO Providing Data

PAGE 11

Data Management Valid and invalid mconlon@ufl.edu is a valid email address jackie @@ufl.edu is not Validity can sometimes be defined by rules which can be checked by machines But names are difficult Mike Conlon (a valid name) A J Smith X (a valid name?)

PAGE 12

Correct and Incorrect Is this knowable? That is can we determine whether it is a correct or incorrect statement? Who can determine if this is knowable? Who can determine if this is correct? Most institutions have processes for providing authoritative sources for some data elements

PAGE 13

Authoritative Sources Hint: Trick question

PAGE 14

Legal Name HR is authoritative for legal name. HR must record and process the legal name for federal tax purposes. Legal names are changed through a court process and recorded via a W4 form by HR. Ex: James Bernard Machen President of UF Preferred Name The individual is authoritative for preferred name (but mediated by the institution?) Ex: Bernie Machen

PAGE 15

Who is authoritative? For each field in VIVO, determine who is authoritative a data provider, the individual The two field pattern often recurs: UF Business Email Enterprise Systems is authoritative Alternate Email Individual is authoritative (and VIVO becomes the system of record for this field)

PAGE 16

VIVO as a System of Record VIVO is a system of record for any data element for which the institution has no system of record. eraCommons ID Names Email addresses

PAGE 17

Editing Data values Data values should be edited in the systems of record When VIVO is not the system of record, the data element should not be editable in VIVO When data elements are not editable in VIVO, data flows should be in place to insure that values changed in source systems are mirrored in VIVO

PAGE 18

The Source System is Always Right Business Phone Number changed in University Directory IT process detects change, passes to VIVO Change is mirrored in VIVO within 60 minutes The university maintains an official directory of business phone numbers. Business Phone numbers can not be changed in VIVO. Business phone numbers are changed in the UF Directory. These changes are passed to VIVO so that VIVO is always up to date.

PAGE 19

The Source System is Always Right, Part 2 Office of the Registrar produces data on teaching each term IT process loads the data into VIVO Faculty profiles are updated in VIVO with teaching records The university registrar maintains official records of which person(s) taught which instances of which courses each term. Such information can not be edited in VIVO. Teaching information is produced each term. This data is loaded into VIVO and cross linked to the instructors, courses. The Registrar amends the teaching records well after the term is complete sometimes as much as six months after the term is complete, based on department reports, effort tracking and other input. These amendments are then made to VIVO.

PAGE 20

THE ROLE OF IT

PAGE 22

Data Flows at UF

PAGE 23

Does VIVO Give Back? a semantic web technology that allows When VIVO has data of interest to the institution, the institution can query VIVO to get the data