Accessibility Assistance

Skip to Content

  


Digital Services (DLC)
Smathers Libraries
University of Florida
P.O Box 117003
Gainesville, FL 32611 USA

P: 352.273.2900
F: 352.846.3702
UFDC@uflib.ufl.edu

An A B C, for Baby Patriots

Description: An A B C, for Baby Patriots, by Mary Frances Ames, 1899.

Collection: Baldwin Library of Historical Children's Literature Digital Collection

Ramón Figueroa Mexican & Cuban Film Poster Collection


Collection: Digital Library of the Caribbean

Drew Field Echoes

Description: Newspaper published at the Drew Field Air Force Base in Tampa, Florida.

Collection: Florida Digital Newspaper Library

Antique Maps, Historic Sanborn Maps, and Aerial Photography



Collection: Map & Imagery Library Digital Collections

Archie Carr and Sea Turtles

Description: Archie Carr attaching weather balloons to sea turtles.

Collection: University Archives Photograph Collection

Alfred Browning Parker

Description: Alfred Browning Parker, architectural drawings, from the University of Florida Architecture Archives

Collection: University of Florida Libraries Architecture Archives Collection

Digital Library Center: Department Goals for 2009-2010

The Digital Library Center provides a forward-thinking framework for expanding the UF Libraries in the information age. To meet current and future needs, the Digital Library Center advances collaborative interdisciplinary research by creating digital content; implementing and integrating multiple interoperable standards to ensure optimal access and preservation; and additional tools to enhance digital content and extend research possibilities.

The Digital Library Center facilitates and focuses the Libraries' development and integration of digital programs and services within and extending from the University of Florida.  The Digital Library Center develops digital content (digitization of analog materials in all formats and support for born digital materials) for inclusion in the University of Florida Digital Collections System, which is supported by the Digital Library Center. The University of Florida Digital Collections System features a robust standards-reliant infrastructure that allows for the automatic translation among multiple metadata standards (MODS/METS, MARC, DC) for maximized interoperabilty and allows for customized interfaces and views depending on the institution contributing the materials, the collection or project, and the material type.

Because digital content and collections are incomplete without context, the Digital Library Center undertakes collaborative scholarly research initiatives to create the necessary contextual supports through interdisciplinary research. The Digital Library Centers many research, content development, and technology assistance partnerships which cannot be adequately represented by project names and statistics alone.

Important elements include:

  • Standards for digital library projects, including criteria for selection, development and use;
  • Quality assurance in capture or conversion, and value-added enhancement through indexing or mark-up;
  • Digital preservation and archiving of master files and online access derivatives to ensure long-term preservation and maintenance of digital collections; 
  • Effective methods for connecting the Libraries' digital collections to researchers, students, and the public
  • Development of productive partnerships with individuals and groups at University of Florida, as well as the community beyond the University; and
  • Contextual development to ensure digital materials are properly presented, in terms of the user interface and in terms of the proper chronological, temporal, and cultural significance

Current Data on Digitization and the UF Digital Collections (7/5/2009, printable version here)


Categories for Goals:


Administrative and Operational Goals:

  • Continue to streamline and integrate operations through:
    • Seeking newspaper born-digital content
    • Automating processes: internal archiving with CNS' NSAM/Tivoli solution
    • Evaluate recent streamlining efforts (reduced TOC for select projects, faster newspaper QC) for further refinement
  • Establish three digitization production queues
    • See production related goals at the bottom of this page
  • Collaborative Projects; Goals:
    • Conduct one usability study on UFDC each year. (May-June 2009)
    • Initiate discussions with the University Press of Florida for possible collaboration
    • Collaborate with the Documents Department on a digitization project to serve all of the Selectives that UF supports as a Regional in FDLP, including materials that also support dLOC
    • Continue to support partners with ingest of partner files (UF partners, including: Harn, Herbarium HSC Archives; External partners) and liaising
    • Continue training programs:
    • Continue outreach programs, presentations, and tours
    • Continue collection page web design support including help pages and documentation
    • Contine design support for collateral materials (postcards, bookmarks, etc); all displays here

Technology Project Goals

UFDC & DLC Toolkit (Included are goals for 2009-2010 and beyond)

  • See the UFDC Development page for details
  • Priorities (see details in extended list after this):
    • Statistics and OAI-PMH corrections
    • Add text as an OAI-PMH format for single item view
    • myUFDC
    • Ability to enter coordinates via Google Map
    • Image Server
    • PRO and ALTO conversion
  • Priorities, details on list above:
    • Maintenance/Ongoing Support
      • UFDC and DLC toolkit and dLOC Toolkit support
        • Tweaks from DR training
        • Compact SQL Express (complete 6/16/09)
      • Ongoing finessing of MARCXML record feed into Endeca from UFDC
      • GoUFDC updates to absorb FDA prep tool by having UFDC METS reference METS and DAITSS schemas on UF's site (to have it work even if FCLA site is down and avoid need to embed DAITSS information or to fail if FCLA's site with the schema is down while FDA is accepting files)
    • Search:
      • Enhancements to Lucene indexing for UFDC collections
      • Add ability to customize searches at hierarchical levels. (i.e., newspaper title level for FDNL)
      • Show search terms in context in search results
      • Using .PRO files to highlight text on the page
      • Converting .PRO to ALTO
      • In search results on the item page, showing the @15 words before and after the search term(s)
      • Searching by address (Aerials grant)
      • Faceted searching in UFDC
      • Customizable user response pages by collection 
      • Z39.50/ZING SRU
    • myUFDC or bookbag functionality
      • User login system
      • Users should be able to logon, view existing book bag, and add items to book bag
      • From their book bag, they should be able to email and print
      • Viewing items in book bag should have most the same types of views as any browse/search
      • Users should be able to organize into folders
      • Users should be able to make their book-bag or a folder therein public, for others to see
    • Self-submission tool and on-line metadata editing capabilities
      • For authenticated users
      • Non-authenticated users: online metadata updates and self-submittal tool with vetting system (request by NHPRC grant)
      • Option to create records online even without items
      • Option to see records w/o items as citations (flag in record to show for all) or not show (default) to all, but to show under &dlc=yes
      • WYSIWYG for editing collection pages
        • Authenticated users can edits to pages in their collections
        • Easier collection page and supplemental page creation, contribution, and editing by CMs (libguides)
    • Implement new zooming image technology
      • Create a new zoomable image server based on Kakadu decoder
      • Include ability to read an ALTO file and highlight the corresponding text regions
    • Zoning for newspapers
      • Integrating the newspaper zoning tool into QC for copyright blur processing
      • After manual zoning for copyright information, refine to create auto-zoning tool within preQC that "guesses" newspaper zones for QC to use
      • Ongoing refinements to zoning
    • Born digital ingest: auto-split of PDF into TIFF files
    • BIB ingest optimization: MARCedit
    • UFDC Display Needs
      • Add finding aids / EAD
      • Slideshow displays (Treister and others)
      • Implement auto-translations for all of UFDC once primary terms are translated
    • Updates based on current usability testing
    • Eventual Workflow Goals:
      For the following below, programming will be needed for PreQC, UFDC loading/building, QC tool moved online and QC tool functioning, OCR processing, and notifications of load/ocr complete for archiving
      • During PreQC, all files are loaded to UFDC and can be seen using the "&dlc=yes" on the URL
      • QC is done online
      • Once QC is approved, item is live
      • OCR happens to files online after QC approval
    • Possible/Testing

Tracking & Importer Related (Included are goals for 2009-2010 only)

  • (4) Importer enhancements, which were priorities 4, removed per improvements in the DLC/dLOC toolkit that make them no longer relevant on 6/24/09. They are available here for record keeping purposes.
  • (1) Reports (completed 7/9/2009? This note added 7/9/2009)
    Ticket #31208 (and was #31165 and #28862)
    • Reports run weekly for processing stage complete (record created, preqc, qc, ocr, ufdc load, ufdc new, fda, archive) by
      • total overall for all of UFDC
      • collection
      • holding institution
      • person
    • Report display:
      • Raw total numbers for each of the above
      • Number of items in each status area (whatever is most recent of record created, preqc, qc, ocr, ufdc load, ufdc new, fda, archive)
      • List of items in each of the status areas (record created, preqc, qc, ocr, ufdc load, ufdc new, fda, archive) by collection
      • List of items in each of the status areas by holding institution
      • List of items in each of the status areas by person
      • Include where archived (CD/DVD numbers)
      • Include link to item using UFDC standard BIBVID URL format
  • (3) Auto-generated reports
      • Auto-created on a weekly basis for set, regular needs above
      • Auto-added to reporting space online (www.uflib.ufl.edu/digital/organization/goals/2009-2010trackingreports/) for all except the by person reports
      • By person reports should be added to DLC department space (under Admin\2009TrackingDBreports)
    • SQL scripts:
      • SQL scripts for running each of the above should be attached to the Grover ticket (as with #28862 and updated whenever the scripts are updated so that the reports can be easily and quickly run by anyone with DB access)
    • Available Reports
  • (2) Lister
    Ticket #31647
    • Repair Lister so that it works for the portable drives
    • Ticket put in on 6/16; Lister repaired by Mark Sullivan on 6/22/2009
  • (5) Automated ingest
    Ticket #31377
    1. From IA (BHL example, with sample code)
    2. From UF Grad School (ETDs)
    3. From GovDocs/FDLP
    4. From vendors
  • (6) Tracking DB enhancements
    • Sitting with DLC users (Nelda, Laurie, Randall, Lourdes, Dina, and Matt) to see what the different needs are for the Tracking DB interface and then combining and prioritizing those needs with the ones listed below
    • Making all fields tab-able
    • Setting defaults
    • Copyright
      • Removal of secondary publication date by copyright date
      • Public domain/permission granted as auto-selected default; tabbing goes directly to "permissions note" which is on same page with no additional pop-up window and tracking auto-sets date of permission granting to "date permissions information entered date"
      • If not in public domain, user has drop-down to choose "copyright protected" or "creative commons" with standard licenses auto-populated (and the standard information carrying through to the METS)
    • Archive DB design
      • Link BIBVID to CD/DVD
      • Design should accomodate FDA and any other archive information (CNS, cloud-style, wherever)
    • Moving Tracking DB online
      • This may end up being much later in terms of priorities for scheduling, but it is the eventual goal for Tracking to be online, so all changes should help build toward that goal or users should be informed of any potential impacts/issues
  • (7 or as optimal) Related Operational
    [These should be scheduled whenever is optimal for continuous work without impacting other priorities.]
    • FilmLog
      Ticket #29222
      • Adding search by title and country
      • Setting meeting with Preservation, Grants, DLC, and Systems to plan integration of data into catalog and to plan for possible adopt-a-reel program
      OCR automator
      Building from completed Ticket #30325
      • Combine TEXT and PRO job files into single JOB file for each package
      • Will need to look in several SAN locations once SAN is split
      • Log OCR errors to Tracking DB
      • Record OCR value for each package (average of value for each page) in Tracking DB
    • SuperC or VideoLan:
    • Minor assistance
      • Zipping of all aerials for download by flight or county
      • JPEG2000 header editing

Servers (Included are goals for 2009-2010 only)

  • Install and config of new UFDC SAN
  • Shared server config documentation
  • Partioning of DLC San for optimized speed/reduction of Replistor-related issues
  • Space alerts
  • Tivoli for internal archive

  • Updates with order of priority (added 7/4/2009)
    • Priority 1:
      • 31961 and 31912
        • UFDClinux1 drive failure; currently failed over UFDClinux2
    • Priority 2:
      • Install and config of new UFDC SAN space
      • Install and config of new DLC processing SAN space
    • Priority 3 and ongoing:
      • 31962
        • Testing for failures and updated and shared server install, config, and emergency procedures documentation
    • Priority 4
      • 27052:
        • Tivoli for internal archive, requires DLC processing SAN space to be corrected first
    • Priority 5
      • 31960:
        • PreQC Virtual Machine needed. This is to speed processing and increase efficiency, so this is to improve operations and not part of critical operations and thus has a lower priority.

Digitization Production Goals

Work on core ongoing projects in the three production queues (Newspapers, IR & institutionally related, and Main).

  • Newspapers; Goals:
    • Catch up to only one year behind in digitization production for Florida Newspapers (working from oldest and current year to narrow the gap)
    • Move 30 newspapers born digital (12 newspapers are now born digital)
    • As possible, integrate Caribbean Newspapers into queue
  • IR & Institutionally Related; Goals:
    • List of born digital serials and UF Publications chosen by selectors is here, goal is to move UF related materials from the Main queue here
    • Support Open Access by digitizing UF materials as chosen by selectors, including:
      • Florida Anthropologist
      • UPF Orange Grove books
    • ETDs: Increase to three departments that submit PILO to submit electronically. (PILO departments)
    • WID: Schedule for processing of Women in Development materials and processing that matches the planned schedule
  • Main Queue; Projects and Goals:
    • dLOC:
      • Meet all grant expectations.
      • Complete final report and all required paperwork for grant on time.
    • Caribbean Newspaper Digital Library (CNDL) Goals:
      • Develop copyright/permission request manual and training guides.
      • Acquire permissions to digitize more Caribbean newspaper titles
      • Digitize Caribbean Newspapers
      • Acquire born digital files from newspaper publishers
      • Move from Main into the Newspaper Queue
    • Everglades (grant period 1/09-12/11; digitization production 4/2009-11/2011 or 31 months); Goals:
      • Remain on schedule
        • Final total which should be 99,690 pages
        • 99,690 pages in 31 months requires an average of 3,216 pages per month; actual production per month will vary for the letterbooks and photos, which are more time consuming
        • On track in 7/09 with 11,791 pages loaded
      • Migrate FIU EDL materials into UFDC
      • Core Project Team:
        • Lourdes: .10 FTE
        • Jane: .10 FTE
        • Matt: .05 FTE
        • Laurie: .05FTE
        • Scan Techs: 2, each .375FTE
        • QC Techs: 2, each .25FTE
    • Grant Development, Project Development, & Prototypes:
      • Continue to support new projects to support the Libraries' collection development goals, including those that may soon be grant funded:
        • Florida Aerials
        • Digging into Data
        • Historic St. Augustine
        • Physical conservation and digital preservation for architectural drawings
        • Collaborative ASERL Project, "Intellectual Underpinnings of the American Civil War"
    • Smaller Projects:
      • Continue to support smaller projects, especially for exhibits and to share experience/cross-train
      • Florida History materials (Collaborative ASERL Project, "Intellectual Underpinnings of the American Civil War")
    • Older Projects:
      • Continue to complete older projects and migration of older project materials as identified for priority by collection managers. Projects needing completion include these and many others:
        • Baldwin Phase I and II materials, needs 1,706 volumes as of 7/2009 to meet total of 7,339 from grant reports
        • Herbarium Specimens
        • Migration of PALMM materials to UFDC (FEOL and FHP are all that remain as of 7/2009)
        • Samuel Proctor Oral Histories: all that were DARK and burned to DVD need to be pulled, copyright status changed to permissions grant per SPOHP, and loaded (all that were on the SAN have been fully processed)
        • Ephemeral Cities newspapers
        • NDNP newspapers
        • Continue to work through digitized microfilm backlog
    • Metadata corrections
      • Newspapers: dates for all and in proper serial hierarchy
        • Then, all standardized into English for automatic translations, but this requires all/majority of pages in translation prior
      • Serials:
        • All in uniform and proper serial hierarchy
      • Updates
        • Container as an element, mapping to box number from EAD
        • Diacritics: corrected, normalized to programmatically ensure corrections remain consistent in support of searching with and without diacritics with proper results
      • Specific projects/issues
        • FIRM Maps
        • FGS: Normalized as much as possible into sets without deviating from their frequent deviation
        • Caribbean Studies Association: Preference is for series title noting conference proceedings and year
        • More as found from catalog records (wrong OCLC, Aleph entered at some point)
        • Yulee: Records are only item level; need container (box) and folder title added for 2,400 Yulee items prior to June 2010 for Yulee day
        • Everglades Digital Library Founders Collection: on postcards, in QC label recto and verso for front and back instead of sequential page numbers

Last modified: Saturday September 17 2011 lnt