Software
I write software for libraries and archives. I try to live up to at least the former in Release Early, Release Often, so most of my code musings can be found on GitHub. The idea is things incubate there, and if they ever become anything more than a plaything they migrate to a place like CPAN, RubyForge, Python Cheeseshop, etc…
Here are some things I’ve worked on in the past that are on GitHub (courtesy of Kenny Katzgrau’s handy GitHub/BitBucket Project Lister for WordPress):
-
bagit
create BagIt style packages of digital content (26 watchers)
-
dflat
an implementation of the dflat and redd specifications from CDL for versioning of digital objects (16 watchers)
-
empirical-cloud
a little demo visualization of owl:sameAs links in billion triple challenge data (8 watchers)
-
dewey-crawler
simplistic crawler and serializer for linked data at dewey.info (7 watchers)
-
dev8d-linked-data
some experiments with linked data available from the dev8d conference (6 watchers)
-
chronam-widget
view on NDNP content using just HTML/JavaScript and the Chronicling America API (5 watchers)
-
bisac
top level BISAC subject vocabulary (4 watchers)
-
europeana-crawler
a simple crawler of the RDFa in Europeana (4 watchers)
-
data-gov-uk-harvester
tiny little project to harvest rdfa metadata from data.gov.uk (3 watchers)
-
databib-metadata
example html/metadata examples for databib (3 watchers)
-
django-sugar
Curated collection of all the sweet Django helpers/utilities developers create, and sometimes recreate too often. (2 watchers)
-
ead-finder
use Google to find public EAD XML documents (2 watchers)
-
bagit-ruby
Ruby Library and Command Line tools for bagit (1 watcher)
-
ckanext-storage
CKAN storage extension. (1 watcher)
-
alto-words
simplistic calculation of the ratio of dictionary words to all words in a METS Alto OCR file (1 watcher)
-
collection
Cooper-Hewitt's Collection Database (1 watcher)
-
beat
little experiment to look at links in LC bibliographic data (1 watcher)
-
django-pagination
A set of utilities for creating robust pagination tools throughout a django application. (1 watcher)
-
bootstrap
CSS toolkit from Twitter (1 watcher)
-
django-tastypie
Creating delicious APIs for Django apps since 2010. v1.0.0-beta (1 watcher)
-
aotycmp
hack to see what well reviewed albums-of-the-year are available on Spotify and Rdio (1 watcher)
-
fastcat
navigate wikipedia categories quickly in a local redis instance (1 watcher)
-
fido
Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows. (1 watcher)
-
congress-legislators
Members of the United States Congress, 1789-Present, in YAML, as well as committees and presidents. (0 watchers)
-
echochamber
download/visualize the connections between the followers of a given Twitter user (0 watchers)
-
emailz
turn mboxen into rdf, and visualize w/ d3 (0 watchers)
-
dpla-map
a simple pure html/javascript DPLA/GoogleMap mashup (0 watchers)
-
antiharassment-policy
Code4lib anti-harassment policy drafting space (0 watchers)
-
archivesspace
The ArchivesSpace archives management tool (0 watchers)
-
bell
Alexander Graham Bell Family Papers Metadata (0 watchers)