<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4926451824261299693</id><updated>2012-02-01T04:31:18.179-08:00</updated><category term='preserv2'/><category term='digital library'/><category term='Atom'/><category term='blackboard'/><category term='Fedora'/><category term='#scape'/><category term='DROID'/><category term='ECDL2008'/><category term='#osdiii'/><category term='OAI-ORE'/><category term='risk analysis'/><category term='#opf'/><category term='ipres09'/><category term='Pronom'/><category term='#openplanets #opf #digitalpreservation'/><category term='linked data'/><category term='signature files'/><category term='iedemonstrator'/><category term='#sits #crig #dev8d #code4lib #sword #orcid #rest #apis #crud'/><category term='preservation watch'/><category term='#fileidentification'/><category term='web services'/><category term='digitization'/><category term='#digitalpreservation'/><category term='scape'/><category term='#sword2 #depositmo easychair'/><category term='agents'/><category term='EPrints'/><category term='CRIG'/><category term='preservation'/><category term='#preserv2'/><category term='opf'/><category term='github debian deb changelog'/><category term='dorsdl2'/><category term='#openplanets'/><category term='#crigshow'/><category term='Preserv'/><category term='OAI-PMH'/><category term='repository'/><category term='Open Repositories 2008'/><title type='text'>Daves thoughts on stuff</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>22</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-1386325288664783565</id><published>2012-01-20T08:27:00.001-08:00</published><updated>2012-01-20T08:42:12.124-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='github debian deb changelog'/><title type='text'>Making Debian Changelogs from Github repositories</title><content type='html'>One of the many things that irks me is the gap between good developers who put all their code on platforms such as GitHub, and those who then actually bother to put some effort into packaging up their code for easy platform installation. &lt;br /&gt;&lt;br /&gt;I have come to the realisation that this is mainly due to the pedantic nature of packaging formats and platform lock in. One such example is the exacting format of the debian changelog...&lt;br /&gt;&lt;br /&gt;&lt;a href="http://p2-registry.ecs.soton.ac.uk/opf/gh2ch/"&gt;GitHib2Changelog&lt;/a&gt; is a bit of code that I knocked together to help in this situation. It takes a GitHub repository URL and builds a debian changelog from the repository commits and tags.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/-dQQtRWJC1zw/TxmYH4qTEwI/AAAAAAAAAFE/RH2MrLzNCZs/s1600/gh2hc.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 580px;" src="http://2.bp.blogspot.com/-dQQtRWJC1zw/TxmYH4qTEwI/AAAAAAAAAFE/RH2MrLzNCZs/s1600/gh2hc.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5699754064650375938" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;By looking at the tags and commits it works out which commits are related to which tags (something GitHub APIv3 doesn't do) and then outputs this directly to you already formatted. &lt;br /&gt;&lt;br /&gt;The service is built in php, and is web based with both a pretty front end and API access. &lt;br /&gt;&lt;br /&gt;Ironically, since i've now committed the code to GitHub &lt;a href="https://github.com/davetaz/Github2ChangeLog"&gt;here&lt;/a&gt; I now need to use the service on itself and build the easy to install packages. More on that soon...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1386325288664783565?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/1386325288664783565/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=1386325288664783565' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1386325288664783565'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1386325288664783565'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2012/01/making-debian-changlogs-from-github.html' title='Making Debian Changelogs from Github repositories'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-dQQtRWJC1zw/TxmYH4qTEwI/AAAAAAAAAFE/RH2MrLzNCZs/s72-c/gh2hc.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-3103110777547409006</id><published>2012-01-19T06:51:00.000-08:00</published><updated>2012-01-20T08:39:08.926-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#sword2 #depositmo easychair'/><title type='text'>DepositMOre - The Prototype</title><content type='html'>Building on the success of DepositMO and SWORDv2, I thought it would be a good idea to put a quick HTML5 client together to save myself some pain.&lt;br /&gt;&lt;br /&gt;The basic premise of this web-based client is to automatically search for "your stuff" in a number of ways and then allow it all to be submitted to a repository in one click. &lt;br /&gt;&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/-WDA-88sy11c/Txgvj2mr50I/AAAAAAAAAEI/Yfwv3R4oXhs/s1600/easychair_desktop.png"&gt;&lt;img style="float:right; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 320px; height: 162px;" src="http://1.bp.blogspot.com/-WDA-88sy11c/Txgvj2mr50I/AAAAAAAAAEI/Yfwv3R4oXhs/s320/easychair_desktop.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5699357621437065026"&gt;&lt;/a&gt;First target for me was www.easychair.org. This service is used as an online conference submission and review system. In a nut-shell if an author wants to get accepted into a conference, easychair is one system which they WILL have to battle with in order to submit their content. As a result there is a strong potential that easychair knows about many publications which should also be present in other systems.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/-uD3VFZnJ9VA/TxgwT1_4uQI/AAAAAAAAAEU/tvKLK2oAYr8/s1600/easychair_paper.png"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 320px; height: 174px;" src="http://2.bp.blogspot.com/-uD3VFZnJ9VA/TxgwT1_4uQI/AAAAAAAAAEU/tvKLK2oAYr8/s320/easychair_paper.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5699358445908048130"&gt;&lt;/a&gt;From the main screen in easychair it is possible to navigate and find the many conference publications which you have submitted. Each publication is tied to a conference and it can take a substantial number of clicks to navigate between each publication.&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;DepositMOre&lt;/h2&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/-1vtThDXev9E/TxgxHmACtZI/AAAAAAAAAEg/jpP977aiIsU/s1600/deposit_more_login.png"&gt;&lt;img style="float:right; margin:0 0 10px 10px;cursor:pointer; cursor:hand; height: 200px;" src="http://3.bp.blogspot.com/-1vtThDXev9E/TxgxHmACtZI/AAAAAAAAAEg/jpP977aiIsU/s320/deposit_more_login.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5699359334966932882"&gt;&lt;/a&gt;DepositMOre is a modular system which is intended to be a home for many services which locate your publications. The first module to be developed is for easychair.&lt;br /&gt;&lt;br /&gt;By simply providing your login credentials to the DepositMOre system, it will not only list all your authored items from easychair but also check if these are present in your locally detected repository. If they are not deposited, and they should be then one click will do this for you. &lt;br /&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/-Jn9fKQenFlo/TxhEiufSZWI/AAAAAAAAAE4/eDUgULqIVpY/s1600/depositmore.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 640px;" src="http://3.bp.blogspot.com/-Jn9fKQenFlo/TxhEiufSZWI/AAAAAAAAAE4/eDUgULqIVpY/s1600/depositmore.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5699380691822863714" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;A combination of HTML5 and SWORD2 make this process quick and seemless! Multiple items can be submitted at once and as each are submitted you can instantly click a link to your item and can view it in the repository.&lt;br /&gt;&lt;br /&gt;The following video gives a demo of the prototype in action. We hope to continue development with the support of a funded project.&lt;br /&gt;&lt;br /&gt;&lt;iframe width="640" height="360" src="http://www.youtube.com/embed/m3Jk2nSvPys" frameborder="0" allowfullscreen&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;&lt;h2&gt;Technologies Used&lt;/h2&gt;&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;HTML/Javascript/JQuery/PHP&lt;/li&gt;&lt;br /&gt;&lt;li&gt;SWORD2 PHP Library - Stuart Lewis - https://github.com/stuartlewis/swordappv2-php-library/&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3103110777547409006?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/3103110777547409006/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=3103110777547409006' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3103110777547409006'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3103110777547409006'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2012/01/depositmore-prototype.html' title='DepositMOre - The Prototype'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-WDA-88sy11c/Txgvj2mr50I/AAAAAAAAAEI/Yfwv3R4oXhs/s72-c/easychair_desktop.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-7303220030282368396</id><published>2011-10-31T04:23:00.001-07:00</published><updated>2011-10-31T04:41:55.452-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='DROID'/><category scheme='http://www.blogger.com/atom/ns#' term='preservation watch'/><category scheme='http://www.blogger.com/atom/ns#' term='opf'/><category scheme='http://www.blogger.com/atom/ns#' term='scape'/><category scheme='http://www.blogger.com/atom/ns#' term='signature files'/><title type='text'>A little preservation watch tool for DROID users</title><content type='html'>&lt;div&gt;Ever wondered what has changed in each new signature file released for The National Archives DROID tool?&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Want a way to find out what objects a new signature file might affect or reclassify?&lt;/div&gt;&lt;br /&gt;&lt;div style="font-size: 3em" align="center"&gt;&lt;a href="http://users.ecs.soton.ac.uk/dt2/droid/"&gt;DROID Sig Change Tool&lt;/a&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;I have collected together all the available DROID signature files (still want more) and produced a little preservation watch tool that surmises changes between signature file versions. A summary is produced which outlines the signatures and file formats added to each new signature file. Additionally by selecting any two of the signature files, a user is also able to compare two specific versions.&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;Soon to be added will be the ability to subscribe to an email or RSS feed which alerts of new signature files and changes, allowing active preservation watch.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;In order to tailor this more to a users n  eeds I'm contemplating allowing selection of specific extensions/formats which users care a lot about and producing an alerting service which focusses on changes to only these types.&lt;/div&gt;&lt;br /&gt;&lt;div&gt;Thoughts welcome...&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-7303220030282368396?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/7303220030282368396/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=7303220030282368396' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/7303220030282368396'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/7303220030282368396'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2011/10/little-preservation-watch-tool-for.html' title='A little preservation watch tool for DROID users'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-2816574899736050876</id><published>2011-07-18T06:01:00.001-07:00</published><updated>2011-07-18T06:50:25.940-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#openplanets'/><category scheme='http://www.blogger.com/atom/ns#' term='#digitalpreservation'/><category scheme='http://www.blogger.com/atom/ns#' term='#opf'/><category scheme='http://www.blogger.com/atom/ns#' term='#fileidentification'/><category scheme='http://www.blogger.com/atom/ns#' term='#scape'/><title type='text'>More on File Identification Tools</title><content type='html'>Since I last wrote a post on this back in March I have started some work for the Open Planets Foundation. As I said in my previous post, I see no reason to have too many unmaintainable tools when we could just pick the best one... the problem is making this choice (for some). &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Which tool?&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;Simple - The one which is currently the most widely adopted... &lt;b&gt;file&lt;/b&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Opinions may vary on this however ALL of these arguments talk about the feature set of a particular tool or the slowness of the tool when scanning billions of files. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Feature Sets and Ease of Use&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;File&lt;/b&gt; is a very simple tool which offers a mime-type and limited metadata exposing of the file types about which it knows. It only accepts single file execution, however you can wildcard it's input in the linux shell and it executes extremely quickly. In my testing file took 2.375s to identify 1000 files, that's 421 files a second (see the comparison to DROID and FIDO @ http://www.openplanetsfoundation.org/blogs/2010-11-03-fido-%E2%80%93-high-performance-format-identifier-digital-objects#comment-75). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Other tools offer more power in other ways, so DROID fits well with The National Archives (UK) Digital Continuity project, providing a PRONOM identifier and mapping back to tools which can perform many operations on these files. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;DROID&lt;/b&gt; is an ever improving tool as the underlying data (number of files it can identify) expands. However certain decisions have led to it becoming difficult in recent times to simply profile a single file. Rather it has become a tool which is now quiet heavily integrated with other systems rather than loosely coupled. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;FIDO&lt;/b&gt; resulted from this realisation that DROID was becoming too slow and painful to profile a single file. Originally written in python, FIDO performs the same classification as DROID (using the same signatures) in a much shorter time and provides the PRONOM PUID as output. FIDO provided a great proof of concept that this operation could be quick however suffers from the problem that someone has wrapped the current release (0.9.5) of FIDO in java. This slows it down significantly due to having to launch the JAVA VM! &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;FITS&lt;/b&gt; - The file identification tool set.&lt;/div&gt;&lt;div&gt;This final tool pretty much wraps everything, some really useful and detailed output can be gained as a result of running not only all the identification tools, but also classification tools like exiftool and ffmpeg. Lots of detail, really slow! Also, more so than any of the others, FITS suffers from the problem of being up to date more than the others. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The problem with FITS is that it wraps the other tools to provide one output. In the case of FITS they still wrap DROID v4 with a very old signature file. They chose DROID v4 as this was the last version with decent command line execution, which is (i'm guessing) the way they call it. FITS wraps a great number of other tools in it's distribution as well, such as the exiftool, which already have package managed versions, thus all the tools FITS uses are being constantly dated by new versions which then required the effort to wrap them. FITS is a great attempt at a very valuable tool, however the problems with the tools it bundles being updated constantly is likely to cause many maintainability problems. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Along with &lt;b&gt;all the other tools&lt;/b&gt;, FITS suffers from the fact that it doesn't update the DROID signature file (a format which hasn't changed) automatically. This is a simple way for a tool to keep up to date, we should not have to rely on the users doing this when the tool should just be doing it for you! People are lazy these days and expect the package to just work, or their to be an available update of the whole package (akin to App Store approaches). The users are right in this respect BTW! &lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;File&lt;/b&gt;, the tool I haven't mentioned for a while is packaged managed by every widely used platform now, so if there is an update, people are alerted to it. From this point it only takes one click to download the latest, greatest and fastest version. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Packaging&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;/b&gt;Like Weird Al's song "Albuquerque" I have finally got to the point (but not the conclusion) ... Packaging.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In order to keep users happy we MUST learn to allow them to download and use tools which suit them. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Personally I'm fed up of JAVA integration, when to install a package you have to first install Maven, then install something else, then do this..... blah blah bored....&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We need to start packaging cross platform tools inside one click install MSIs (windows), RPM (redhat) and DEB packages (Ubuntu/Debian and the rest of Linux). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I don't care if these packages install dependencies but the user shouldn't have to take more than one step to install a tool. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Futher the tool should either self update, or the user should be prompted to update it via the package managing option which &lt;b&gt;their &lt;/b&gt;operating system already contains. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Using packages, FITS could simply be a very small meta package which depends on other packages, thus keeping the whole suit of tools up to date... independently. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Conclusion&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt; * Yes a tool should be fast&lt;/div&gt;&lt;div&gt; * Yes a tool should be feature rich&lt;/div&gt;&lt;div&gt; * Most of all, you should be able to install it and keep it up to date easily!&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;What's Next&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The billion dollar question, for me I've done some performance testing of the various tools and decided that speed is due to features. As a package gets bloated and feature rich, it becomes slower! The faster it is the simpler it is. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;What i'd like to be is to make the fastest one (&lt;b&gt;file&lt;/b&gt;) feature rich without bloating it and slowing it down. Also file is already package managed which saves me what appears (according to the other tools) to be a very hard job.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Some investigation on this aim is likely to follow, along with some requirements gathering and classification on critical features before things move forward.   &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-2816574899736050876?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/2816574899736050876/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=2816574899736050876' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/2816574899736050876'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/2816574899736050876'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2011/07/more-on-file-identification-tools.html' title='More on File Identification Tools'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-2661869222401824269</id><published>2011-03-14T04:38:00.000-07:00</published><updated>2011-03-14T04:57:35.068-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#openplanets #opf #digitalpreservation'/><title type='text'>Preservation Tools - Moving Forward</title><content type='html'>&lt;div&gt;Over the last number of years, JISC and other bodies have funded a number of digital preservation projects which have resulted in some really valuable contributions to the area... now is the time to realise the benefits of this work and provide a digital preservation experience to everyday users. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;To achieve this a not insignificant amount of work needs to be undertaken, namely to identify key applications and separate these from the complex systems into which they have been built. Alternatively many applications now need re-thinking and the best bits built into system which have super-ceded these applications. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;File Format Identification Tools&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;File format identification now has a number of tools available, each with their own advantages and disadvantages, in no particular order they are:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;DROID: &lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Started out as a tool to identify file types and versions of those types. :)&lt;/li&gt;&lt;li&gt;Each file version was assigned an identifier which could be referenced and re-used. :)&lt;/li&gt;&lt;li&gt;Identification of file was done via "signature", not extension matching. :)&lt;/li&gt;&lt;li&gt;Became complex as it was adjusted to suit workflows and provide much more complex information which few people understand or want :(&lt;/li&gt;&lt;li&gt;Added complexity increased the time required for each file classification, no longer a simple tool :(&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;FIDO:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;A new cut down client which takes the DROID signature files and does the simple stuff again :)&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;FILE: &lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;A built in Unix tool installed on every Unix based system in the world already! :)&lt;/li&gt;&lt;li&gt;Does not do version type identification :(&lt;/li&gt;&lt;li&gt;Does not provide a mime-type URI :(&lt;/li&gt;&lt;li&gt;Very quick to run :)&lt;/li&gt;&lt;li&gt;Has the capacity to add version type identification and there is a TODO in the code for it! :)&lt;/li&gt;&lt;/ul&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;With the PRONOM registry now looking at providing URIs for file versions, why can't we stop coding new tools and change the FILE library. This way it could handle the version information and feed back the URIs if people want them. I've looked briefly into this and the PRONOM signatures should be easy to transport and use with the file tool.  &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If I get time I might well have a go at this and feed it back to the community. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-2661869222401824269?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/2661869222401824269/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=2661869222401824269' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/2661869222401824269'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/2661869222401824269'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2011/03/preservation-tools-moving-forward.html' title='Preservation Tools - Moving Forward'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-6566015602337854566</id><published>2011-03-04T09:05:00.000-08:00</published><updated>2011-03-04T09:20:23.880-08:00</updated><title type='text'>Installing Kinect on Ubuntu (A full guide)</title><content type='html'>1) sudo apt-get install libglut3-dev build-essential libusb-1.0-0-dev git-core&lt;div&gt;&lt;br /&gt;2)  mkdir ~/kinect &amp;amp;&amp;amp; cd ~/kinect&lt;/div&gt;&lt;div&gt;&lt;br /&gt;3) git clone https://github.com/OpenNI/OpenNI.git &lt;/div&gt;&lt;div&gt;&lt;br /&gt;4) cd OpenNI/Platform/Linux-x86/Build&lt;/div&gt;&lt;div&gt;&lt;br /&gt;5) make &amp;amp;&amp;amp; sudo make install&lt;/div&gt;&lt;div&gt;&lt;br /&gt;6) cd ~/kinect/ &lt;/div&gt;&lt;div&gt;&lt;br /&gt;7) git clone https://github.com/boilerbots/Sensor.git&lt;/div&gt;&lt;div&gt;&lt;br /&gt;8) cd Sensor&lt;/div&gt;&lt;div&gt;&lt;br /&gt;9) git checkout kinect&lt;/div&gt;&lt;div&gt;&lt;br /&gt;10) cd Platform/Linux-x86/Build&lt;/div&gt;&lt;div&gt;&lt;br /&gt;11) make &amp;amp;&amp;amp; sudo make install&lt;/div&gt;&lt;div&gt;&lt;br /&gt;12) go to this page at openNI to download the latest NITE release for your platform:&lt;a href="http://www.openni.org/downloadfiles/openni-compliant-middleware-binaries/34-stable" style="text-decoration: none; line-height: 1.4; font-family: arial; "&gt;NITE download page&lt;/a&gt; or for the impatient:&lt;br /&gt;&lt;a href="http://www.openni.org/downloadfiles/openni-compliant-middleware-binaries/stable/54-primesense-nite-beta-build-for-for-ubuntu-10-10-x86-32-bit-v1-3-0/download" style="text-decoration: none; line-height: 1.4; font-family: arial; "&gt;32-bit&lt;/a&gt;&lt;br /&gt;&lt;a href="http://www.openni.org/downloadfiles/openni-compliant-middleware-binaries/stable/53-primesense-nite-beta-build-for-for-ubuntu-10-10-x64-64-bit-v1-3-0/download" style="text-decoration: none; line-height: 1.4; font-family: arial; "&gt;64-bit&lt;/a&gt;&lt;br /&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;br /&gt;13) Save the NITE tarball to ~/kinect and untar it&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;br /&gt;14) cd ~/kinect/NITE/Nite-1.3.0.17/Data &lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;br /&gt;15) Open Sample-User.xml,Sample-Scene.xml and Sample-Tracking.xml and replace the existing License line with the line below: &lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: arial; line-height: 22px; "&gt;NOTE: this is case sensitive! &lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;&lt; vendor="PrimeSense" key="0KOIk2JeIBYClPWVnMoRKn5cdY4="&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;16) Repear step 15 and replace the existing MapOutputMode line with the line below in all 3 files.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;&lt; xres="640" yres="480" fps="30"&gt;&lt;br /&gt;&lt;br /&gt;19) sudo niLicense PrimeSense 0KOIk2JeIBYClPWVnMoRKn5cdY4=&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;20) cd ~/kinect/NITE/Nite-1.3.0.17/&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;21) sudo ./install.bash&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;22) make &amp;amp;&amp;amp; sudo make install&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;23) cd ~/kinect/NITE/Nite-1.3.0.17/Samples/Bin&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;24) sudo adduser YOURNAME video&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;25) nano /usr/etc/primesense/XnVHandGenerator/Nite.ini by uncommenting the two config parameters it contains&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;26) sudo nano /etc/udev/rules.d/51-kinect.rules&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;# ATTR{product}=="Xbox NUI Motor"&lt;br /&gt;SUBSYSTEM=="usb", ATTR{idVendor}=="045e", ATTR{idProduct}=="02b0", MODE="0666"&lt;br /&gt;# ATTR{product}=="Xbox NUI Audio"&lt;br /&gt;SUBSYSTEM=="usb", ATTR{idVendor}=="045e", ATTR{idProduct}=="02ad", MODE="0666"&lt;br /&gt;# ATTR{product}=="Xbox NUI Camera"&lt;br /&gt;SUBSYSTEM=="usb", ATTR{idVendor}=="045e", ATTR{idProduct}=="02ae", MODE="0666"&lt;br /&gt;&lt;br /&gt;27) sudo /etc/init.d/udev restart&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;28) cd ~kinect/Nite-1.3.0.17/Samples/Bin/&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="line-height: 18px; font-family: arial; "&gt;&lt;span style="line-height: 1.4; font-family: arial; "&gt;&lt;br /&gt;29) ./Sample-PointViewer and PLAY&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6566015602337854566?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/6566015602337854566/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=6566015602337854566' title='17 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6566015602337854566'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6566015602337854566'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2011/03/installing-kinect-on-ubuntu-full-guide.html' title='Installing Kinect on Ubuntu (A full guide)'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>17</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-318178988423514450</id><published>2010-11-22T07:13:00.000-08:00</published><updated>2010-11-22T09:11:58.998-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#sits #crig #dev8d #code4lib #sword #orcid #rest #apis #crud'/><title type='text'>Hot Topics in Scholarly Systems</title><content type='html'>Since I last wrote a blog post the world has been going through some harsh times where cutbacks and simplifications have been essential. The phrase "Throw money at it" no longer applies to anything and all of a sudden organisations as well as people seem far more keen to share than before (although we are still not fully open and sharing, mostly it's organisations wanting stuff without sharing themselves, but we'll get there).&lt;br /&gt;&lt;br /&gt;Anyway enough of that, what is actually happening?&lt;br /&gt;&lt;br /&gt;Well I am very proud to be at the forefront of an international effort to hold a series of scholarly technology meetings focussed on solving institutional problems. These meetings, known as the Scholarly Information Technical Summit (SITS) meetings, are being held in alongside many international conferences over the next 2 years and are being backed by all the major international funding bodies. See  http://bit.ly/Scholarly_Infrastructure_Technical_Summit for more info.&lt;br /&gt;&lt;br /&gt;There have now been 2 meetings, although SITS only came about because the first one was so successful. Each meeting conforms to the Open Agenda (see wikipedia) principal and is chaired likewise. This leads to the agenda being very pertinent to the people in the room and often creates conversation critical to the forward momentum of some of the technologies discussed. In the next few paragraphs I'm going to try and summerise the hot topics from the first meeting:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;SWORD&lt;/span&gt; - Put stuff in a repository&lt;br /&gt;&lt;br /&gt;SWORD has undoubtedly been a huge success, it's simple and well supported by many publishers and publishing software (including most notably the Microsoft office suite via the author add-in tool http://research.microsoft.com/authoring). There are however some problems which the community wants to address without making it more complex:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Packaging Formats - What exactly do you submit in your SWORD bundle, how should it be formed. There was no clear consensus other than we feel endpoints should try to support a multitude of formats depending on their users.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Endpoints are hard to find, for both users and the software, this could do with being addressed either via negotiation or meta tags of some sort.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;URIs in the returned package are not well specified to say what they mean or what they should mean.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Not a complete CRUD model&lt;/li&gt;&lt;li&gt;No levels of compliance any more&lt;/li&gt;&lt;li&gt;SWORD uses basic auth (too basic?)&lt;/li&gt;&lt;/ul&gt;The general call was that these points need addressing without making the SIMPLE (that's what the S stands for) too complex. CRUD looks interesting.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;OUTCOME: &lt;/span&gt;A follow on &lt;span style="font-weight: bold;"&gt;SWORD&lt;/span&gt; project has been funded by JISC (UK) along with a number of complementary (but separate) projects including &lt;span style="font-weight: bold;"&gt;DepositMO &lt;/span&gt;(http://blogs.ecs.soton.ac.uk/depositmo) and &lt;span style="font-weight: bold;"&gt;SONEX&lt;/span&gt; (http://sonexworkgroup.blogspot.com/).&lt;br /&gt;&lt;br /&gt;Personally i'm involved in DepositMO which intends to use SWORD (+CRUD) at it's core and extend this even further (outside of SWORD) to be fully interactive with the users. More can be found on the levels of conformance via the DepositMO blog (http://blogs.ecs.soton.ac.uk/depositmo).&lt;br /&gt;&lt;br /&gt;Package guidelines are to be set out by the new SWORD project along with tight definitions on what URIs mean and what it means to CRUD those URIs.&lt;br /&gt;&lt;br /&gt;Being written in to both projects I hope to bring not only technical knowledge to the table but also real world usages.&lt;br /&gt;&lt;br /&gt;There was also a call to look into technologies like OAuth and it's usages in SWORD, however this was a minor part of what became a major conversation at the second meeting.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;Inverse Sword&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;This conversation started on workflows and a discussion on the opportunities for common workflows and their impact. &lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;The problem is that workflows tend to be very specific and quiet heavy weight in their approach to a problem, often constrained by the domain. This is the advantage of SWORD, it doesn't specify one, just a technique for transferring stuff. So what about reverse SWORD where you request a URI and a packaging format you want.&lt;br /&gt;&lt;br /&gt;This basically then re-inforced the conversation on what it meant to have SWORD endpoints supporting full CRUD using content negotiation to agree on packaging formats.  Clearly something to take forward... as it was!&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Storage for Digital Repositories&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Question was (not from me): What is their beyond the Akubra (now DuraCloud) and my two projects (one of which has been finished)? &lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;It is clear that there are now a whole range of storage options and technologies with infinite numbers of APIs, luckily many of the cloud providers use the S3 API (which is good!). So what rules languages are there for expressing &lt;span style="font-weight: bold;"&gt;where&lt;/span&gt; things should be stored?&lt;br /&gt;&lt;br /&gt;I briefly explained the EPrints implementation (labelled as mine but it isn't, it's EPrints property) which uses lightweight plug-ins to communicate with each service. These plug-ins implement 4 API calls (Store, Retrieve, Delete and one other necessary I won't bother explaining here). There is then an XML/XSLT based policy file which dictates which plug-ins are used to store what. Each file is then stored and metadata adjusted to state where it is stored in case policy changes. Upon a policy change, the files can be re-arranged to their correct locations again. This can also handle changes in storage architecture and whole services being off-lined. Advantage with this approach, which the community likes, is that you can use any number of storage solutions simultaneously and store as many copies of files on different ones as you like. For more see http://eprints.ecs.soton.ac.uk/17084/.&lt;br /&gt;&lt;br /&gt;The actions from this were that others were going to look at this implementation to see if this rule based language could apply on other repository platforms. Further it would be nice to have some good reference architectures available from vendors.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Services and Configuration Languages &lt;/span&gt;(was Common Platforms/Tools on the day)&lt;br /&gt;&lt;br /&gt;This was an interesting conversation which started around the idea of being able to re-use technologies by re-using/calling code libraries directly. The problem is here (as I see it) the number of coding environments and versions of these environments available.&lt;br /&gt;&lt;br /&gt;The solution is REST (not SOAP) APIs on the web and abstraction APIs in the code (e.g. SOLR) which enable you to call functions from (say) the command line, without having to &lt;span style="font-style: italic;"&gt;understand&lt;/span&gt; the code.&lt;br /&gt;&lt;br /&gt;David Flanders perhaps summed it up best, there are levels of interaction, some easier than others:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Core System (hard)&lt;/li&gt;&lt;li&gt;Exposing structured data&lt;/li&gt;&lt;li&gt;End user interfaces (including APIs)&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;XML for configuration is a bit of a sticking point with users, but you need a machine readable language to configure the machine. Perhaps the point is here only use XML if you need it otherwise simple config files with "=" signs in is fine.&lt;br /&gt;&lt;br /&gt;There is no real answer to this question other than try and keep it simple... stupid.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Author IDs (URIs)&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/span&gt;Yes it's our favourite topic raising its ugly head again!&lt;br /&gt;&lt;br /&gt;It is clear that there are many efforts in this area, none of which have fully succeeded yet&lt;span style="font-weight: bold;"&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;. &lt;/span&gt;There is still much interest in this area however and it is clear that we should be prepared to handle multiple IDs for a single author and be able to align them (if allowed) at a later stage.&lt;br /&gt;&lt;br /&gt;Currently the project to watch is ORCID which is a continuation of a previous project by Thompson (which failed commercially in this project).&lt;br /&gt;&lt;br /&gt;The consensus was however that we are not wrong to mint URIs for our authors in our repositories.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Conclusions&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Identification/Authorisation is a problem, can technologies like OAuth not only help with authorisation but also with identification? This could be a very interesting area.&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;SWORD being taken forward is a very positive outcome of the first SITS meeting.&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Simple services with simple APIs are so much more effective than "project centric" solutions and bloatware.&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;Simple services are usable by lots of people!&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-318178988423514450?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/318178988423514450/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=318178988423514450' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/318178988423514450'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/318178988423514450'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2010/11/hot-topics-in-scholarly-systems.html' title='Hot Topics in Scholarly Systems'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-1418019467012539878</id><published>2009-10-05T14:18:00.000-07:00</published><updated>2009-10-05T14:52:13.431-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ipres09'/><title type='text'>#ipres09: Are You Ready? Assessing Whether Organizations are Prepared for Digital Preservation.</title><content type='html'>Planets (mainly Tesella) did an online survey targetd at libraries adn archives in europe and then opened it up to 2000 others. Got 206 responses (70% european), 1/3 libraries, 1/4 archvies. Roles - 15% DP, 16% preservation in general, 22% in curation, 16% in IT, rest other.&lt;br /&gt;&lt;br /&gt;Being a DP survey the majority said they were aware of DP issues. 1/6 haven't really thought that much about it and 1/2 don't have a DP policy. Interesting stat, if you have a DP policy in place your are 3 times more likely to have a budget in place. However the majority of those budgets are project budgets and not institutional budgets to actively perform DP in the organisation. &lt;br /&gt;&lt;br /&gt;Also in the survey people are having to preserve all sorts of resources including databases. &lt;br /&gt;&lt;br /&gt;Do people have control over the formats they can accept. National archives - Yes, National Archives (No). Others are pretty balanced between yes and no. &lt;br /&gt;&lt;br /&gt;Amount of content: most say that they have less than 100Tb of content to preserve but people see it expanding to much larger quantities [when it won't be any more scary to preserve than it is now, we'll soon have 1Tb memory keys - heard it here first].&lt;br /&gt;&lt;br /&gt;85% of people are trying to solve the problem and are looking to use plug-and-play components as people don't want to replace their current systems. &lt;br /&gt;&lt;br /&gt;Most important thing in preservation is being able to maintain authenticity [possible european stance]. Emulation is less important. In the middle of the importance graph was the the importance of using metadata standards, but no one can decide what standards to use. &lt;br /&gt;&lt;br /&gt;[Good survey, probably worth looking up and answering it yourself, especially if you are involved in a DP project! How do the aims of your project align with the view of the wider community?]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1418019467012539878?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/1418019467012539878/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=1418019467012539878' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1418019467012539878'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1418019467012539878'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/10/ipres09-are-you-ready-assessing-whether.html' title='#ipres09: Are You Ready? Assessing Whether Organizations are Prepared for Digital Preservation.'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-3194344172771760744</id><published>2009-10-05T14:00:00.000-07:00</published><updated>2009-10-05T14:16:20.033-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ipres09'/><title type='text'>#iPres09: An Emergent Micro-Services Approach to Digital Curation Infrastructure.</title><content type='html'>More stuff, smaller budget, more technology [good start]. We can gain though with digital tech through redundancy, meaning (through context), utility (through service) and value through use [totally agree with the last point esp. wrt OA publishing].&lt;br /&gt;&lt;br /&gt;Projects imperatives: Do more with less, focus on the content, not the systems in which the content is managed [agree, focus on the data, not the software which people care less about]. This leads to the goal of the presentation which is micro-services, small services which provide maximum gain when they can be looped together, replaced, re-written etc.&lt;br /&gt;&lt;br /&gt;There are 12 currently in 4 layers - Storage, Charecterisation/Inventory, Service (index, search, migration), Value (Usage) - [Good slide this] "LOCKSS, Lots of description keeps stuff meaningful, Lots of services keeps stuff useful, Lots of uses keeps stuff valuable".&lt;br /&gt;&lt;br /&gt;Now describing some of the technologies which are being used. Simple restful api's to storage [however he hasn't mentioned the high level software they are using, based upon the first slide in the presentation said to use proven technology]. Emphasizing that storage has moved on (as it always will) and you can gain a lot of benefit though specialist products. &lt;br /&gt;&lt;br /&gt;Storage is the main phase they have done and they have a few other mirco-services done. [Like the aim of it all however there are already lots of tools to provide each it is just the keeping them apart which is hard]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3194344172771760744?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/3194344172771760744/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=3194344172771760744' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3194344172771760744'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3194344172771760744'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/10/ipres09-emergent-micro-services.html' title='#iPres09: An Emergent Micro-Services Approach to Digital Curation Infrastructure.'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-5274314513105060713</id><published>2009-10-05T13:39:00.000-07:00</published><updated>2009-10-05T13:58:58.106-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ipres09'/><title type='text'>#iPres09: e-Infrastructure and digital preservation: challenges and outlook</title><content type='html'>e-infrastructure: Starts by defining infrastructure (see wikipedia) and e-infrastructure specific to a collection of European digital repositories. So basically we are looking at opportunities to build and supply services which are applicable to these repositories. &lt;br /&gt;&lt;br /&gt;Background: EU is supplying lots of support for this and in germany they are researching national approaches, identifying activities and assign tasks to "expert" institutions. By introducing the current fields of project he is outlining that there is still a significant mismatch between the scale of the problem and the amount of effort being expended. From this he outlines that there is a significant lack of common approaches to solving problems. [I don't think this will ever go away, unless there is a mandate, and even then not everyone will want to sign up].&lt;br /&gt;&lt;br /&gt;[Lots of argument] Funding is focused on many individual projects and thus doubles up the the argument that there are no commons. This led leads to a slide about interoperability and standards and the lack of them. [Which again, i don't think will ever go away and I think that we should be appreciative that people tend to pick XML to encode their data in, this makes it interoperable right]. &lt;br /&gt;&lt;br /&gt;[This is a start of project presentation, I don't seem to see that much output. They have some simple models as diagrams, again though at this stage it is hard to see how they are not just another project which will come up with (another) set of standards which no one will then want to adopt.] &lt;br /&gt;&lt;br /&gt;Giving a set of examples now where they are going to re-use and extend existing software/projects. The goals are good, in terms of concrete steps for global infrastructure for registries, data formats, software deposits and risk management. [Just not sure how achievable all this is based upon the fact it has been the aim of many projects already]&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-5274314513105060713?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/5274314513105060713/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=5274314513105060713' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/5274314513105060713'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/5274314513105060713'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/10/ipres09-e-infrastructure-and-digital.html' title='#iPres09: e-Infrastructure and digital preservation: challenges and outlook'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-1374762836853461443</id><published>2009-09-08T07:01:00.000-07:00</published><updated>2009-09-08T07:26:38.474-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='preservation'/><category scheme='http://www.blogger.com/atom/ns#' term='linked data'/><category scheme='http://www.blogger.com/atom/ns#' term='digitization'/><title type='text'>Thoughts on digitization, data deluge and linking</title><content type='html'>It's been a while since I've put a post up and this is probably due to being busy and also trying to tidy up a lot of stuff before starting on new projects.&lt;br /&gt;&lt;br /&gt;In this post then: &lt;b&gt;Digitisation&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;I never really gathered how big the area of digitisation is and how many non repository people are actively involved in digitisation. There are a great many projects &gt;50 who are digitising resources and these include national libraries. Items being digitised include everything from postcards and newspapers to full books and old journals.&lt;br /&gt;&lt;br /&gt;So what's the problem here ... simple ... how many people are digitising the same things? Yes I know that there is so much out there that this is unlikely to be the case however it brings me nicely to the problem of information overload. There is already more valuable information on the internet than we can possibly handle effectively, so how do you ensure that any resources you digitize for open access usage on the web can be found and used? &lt;br /&gt;&lt;br /&gt;I don't normally say this but perhaps we should look at physical libraries for the answer. Libraries are a very good central point where you can find publications related to all subject areas, and if your local library does not have a copy then it will try and find a copy somewhere else. &lt;br /&gt;&lt;br /&gt;How then does this map onto the web? Web sites become the library and links become the references to additional items or items this site does not contain, simple right? Unfortunately with 50+ projects I can count already, this leads to 50+ different web sites all with differing information presented in different ways. Due to the presentation of each web site being totally different this means that in fact they are not a library - that pride themselves on the standard way to organise resources -&lt;br /&gt;thus web sites become books. Thus to find resources we have to rely on search engines and federation. Thus we are back to where we started and we have a problem with  information overload. &lt;br /&gt;&lt;br /&gt;Unfotunately I don't have an answer to this problem, however I do know that links hold the key to the solution. Each website at the moment is simply an island of infromation, what is desperately required is the authors and community to establish links to these resources. If digitisation houses are curating refereed resources then the simplist way to link to these would be to put information about them on wikipedia.&lt;br /&gt;&lt;br /&gt;This would be my final point then, wikipedia is actually a good thing, simply because of the the community aspect. However it also provides many other huge benefits:&lt;br /&gt;&lt;ul&gt;&lt;br /&gt;&lt;li&gt;External resources such as photoes have to have a licience&lt;/li&gt;&lt;br /&gt;&lt;li&gt;In annotating a page/item you create links and establish facts which are available by semantic wikipedia (dbpedia)&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Wikipedia is an easy way to establish your presence on the link data web (linkeddata.org)&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;br /&gt;&lt;br /&gt;So if you are digitising books by an author, add this link to their wikipedia page. If you are digitising a collection of World War images, add links to some of these to wikipedia and flikr.&lt;br /&gt;&lt;br /&gt;Establish links and help yourself to help everyone else.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1374762836853461443?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/1374762836853461443/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=1374762836853461443' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1374762836853461443'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1374762836853461443'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/09/thoughts-on-digitization-data-deluge.html' title='Thoughts on digitization, data deluge and linking'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-3246944847930521768</id><published>2009-05-07T08:43:00.000-07:00</published><updated>2009-05-07T08:57:51.019-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='preservation'/><category scheme='http://www.blogger.com/atom/ns#' term='risk analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='preserv2'/><title type='text'>File Format Risk Analysis! How hard can it be?</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_Q9BtIMtZDxc/SgMEefAnE5I/AAAAAAAAAAM/4fO-XHgzdZ4/s1600-h/risk_analysis.png"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 320px; height: 168px;" src="http://2.bp.blogspot.com/_Q9BtIMtZDxc/SgMEefAnE5I/AAAAAAAAAAM/4fO-XHgzdZ4/s320/risk_analysis.png" border="0" alt=""id="BLOGGER_PHOTO_ID_5333111305503445906" /&gt;&lt;/a&gt;&lt;br /&gt;The trouble with this is the sheer number of different formats out there (PDF,DOC,DOCX....) and the number of versions each of these has, each of which has it's own problems.&lt;br /&gt;&lt;br /&gt;Enter the P2 data registry which now contains over 40,000 facts relating solely to format types. I have been working on this for a while now and a full list of services can be found on the &lt;a href="http://p2-registry.ecs.soton.ac.uk"&gt;registry homepage&lt;/a&gt;. Although the &lt;a href="http://p2-registry.ecs.soton.ac.uk/SPARQL"&gt;SPARQL Endpoint&lt;/a&gt; is a great place to start it doesn't help you much with risk analysis.&lt;br /&gt;&lt;br /&gt;First thing is to determine which properties of a format or format super-type (PDF 1.6 has a super type PDF for example) are important when analyzing risk. Do you consider age, number of software tools, quality of documentation etc? And how do you add weightings to all of these properties as a factor of overall risk? &lt;br /&gt;&lt;br /&gt;Using the registry as a starting point I have drawn up a profile involving these properties as well as many others which each return a high, medium or low risk level. These are then sum-mated and normalised to give a single risk score.&lt;br /&gt;&lt;br /&gt;Enough of me talking: The live service (PDF 1.3) can be found &lt;a href="http://p2-registry.ecs.soton.ac.uk/risk_summary.php?format=http://nationalarchives.gov.uk/pronom/Format/616"&gt;here&lt;/a&gt; and &lt;a href="http://p2-registry.ecs.soton.ac.uk/risk_summary.php?format=http://nationalarchives.gov.uk/pronom/Format/617"&gt;here&lt;/a&gt; for PDF 1.4.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3246944847930521768?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/3246944847930521768/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=3246944847930521768' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3246944847930521768'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3246944847930521768'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/05/file-format-risk-analysis-how-hard-can.html' title='File Format Risk Analysis! How hard can it be?'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_Q9BtIMtZDxc/SgMEefAnE5I/AAAAAAAAAAM/4fO-XHgzdZ4/s72-c/risk_analysis.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-3516378215565453655</id><published>2009-04-08T08:21:00.000-07:00</published><updated>2009-04-08T08:44:51.243-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#preserv2'/><title type='text'>Less talk, less code, more data - The Preserv2 Data Registry</title><content type='html'>Yes, less talk more code (oxfordrepo.blogspot.com) is a good saying but i'm going to argue in this post that in fact we need more data! Having a ton of available services and a load of highly complex and well considered data models is all well and good but without data all of these services are useless; A repository is not a repository until it has something in it (Harnad). &lt;br /&gt;&lt;br /&gt;If we look outside of the repository community for a minute we find the web community we are accumulating a whole ton of data, wikipedia being the main point of reference here. Yet in the repository community we are not harnessing this open linked data model to enhance our data. &lt;br /&gt;&lt;br /&gt;I have been working in the area of digital preservation for a while now and the PRONOM file format registry (TNA UK) has been my friend for many years now and contains some valuable data. However I am concerned with the way I see it progressing. The main thing I use the PRONOM registry for is as a complement to DROID for file format information, and the data here is not even that complete. I am concerned however at the size of the new data model and the sheer effort which is going to be required to fill it with the data which it specifies. &lt;br /&gt;&lt;br /&gt;Why not looked to the linked data web to see how to tie a series of smaller systems together to make a much more powerful and easier to maintain one! &lt;br /&gt;&lt;br /&gt;This is where I have started with the preserv2 registry available at &lt;a href="http://p2-registry.ecs.soton.ac.uk/"&gt;http://p2-registry.ecs.soton.ac.uk/&lt;/a&gt;. &lt;br /&gt;&lt;br /&gt;The preserv2 registry is a semantic knowledge base (RDF triples based) with an SPARQL endpoint, RESTful services and a basic browser. Currently the data is focussed on file formats and is basically made up of the PRONOM database ported from a complex XML schema into simple RDF triples. On top of this i'm beginning to add data from dbpedia (wikipedia RDF'd) and making links between the PRONOM data and the dbpedia data! &lt;br /&gt;&lt;br /&gt;Already this is helping is ascertain a greater knowledge base and the cost of gathering and compiling this data is very low. Other than that the registry took me less than a week to construct! &lt;br /&gt;&lt;br /&gt;So "Go forth and make links" (Wendy Hall) is exactly what I'm now doing. With enough data you will be able to make complex OWL-S rules that can be used to deduce accurately facts such as formats which are at risk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3516378215565453655?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/3516378215565453655/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=3516378215565453655' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3516378215565453655'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3516378215565453655'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/04/talk-code-data-preserv2-data-registry.html' title='Less talk, less code, more data - The Preserv2 Data Registry'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-7166652851684473013</id><published>2009-01-21T05:45:00.000-08:00</published><updated>2009-01-21T05:50:13.043-08:00</updated><title type='text'>EPrints 3.2 - Amazon S3/Cloudfront Plug-in</title><content type='html'>A quick post to say that we have just successfully tested an EPrints 3.2 (svn) install with the new Storage Controller plugged into Amazon S3! &lt;br /&gt;&lt;br /&gt;This has quiet a lot of implications for both EPrints and other projects wanting to provide external services which operate on objects in a repository. We hope to bring people more news on this at the upcoming Open Repositories 2009 conference in Atlanta. &lt;br /&gt;&lt;br /&gt;For more information on this all check out storage section on the Preserv2 website @ &lt;a href="http://www.preserv.org.uk" target="_blank"&gt;www.preserv.org.uk&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-7166652851684473013?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/7166652851684473013/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=7166652851684473013' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/7166652851684473013'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/7166652851684473013'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2009/01/eprints-32-amazon-s3cloudfront-plug-in.html' title='EPrints 3.2 - Amazon S3/Cloudfront Plug-in'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-3694637630389146162</id><published>2008-09-18T02:14:00.001-07:00</published><updated>2008-09-18T02:15:39.108-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='digital library'/><category scheme='http://www.blogger.com/atom/ns#' term='repository'/><category scheme='http://www.blogger.com/atom/ns#' term='dorsdl2'/><category scheme='http://www.blogger.com/atom/ns#' term='ECDL2008'/><title type='text'>Institutions hate repositories... one simple reason.</title><content type='html'>Open access is not enough!&lt;br /&gt;&lt;br /&gt;People want to give Open Access to some of their materials at their institution however the IR software is seen as a means to manage all Institutional content and not just that which is Open Access and part of the external image of the Institution.&lt;br /&gt;The problem exists in the other direction as well where repository software is trying to solve these problems, thus people are not likely to use this software until it is included.&lt;br /&gt;&lt;br /&gt;So what do we end up with...&lt;br /&gt;&lt;br /&gt;Lots of Repository Islands which aren't interoperable with each other!&lt;br /&gt;&lt;br /&gt;So if we solve the access and copyright issue will people use the software? errrr No. At this point the software is an all in solution and not a service which can be utilised by current institutional practise ... Give up...?&lt;br /&gt;&lt;br /&gt;No!&lt;br /&gt;&lt;br /&gt;Focus on providing a service, e.g. something which can manage your Digital Resources and enable this to plug to existing institutional services. Some softwares would argue they support this already. OK good, so don't try and solve the problem if it is just an integration issue.&lt;br /&gt;&lt;br /&gt;To the repositories: Decouple! Build a set of services, build ways of plugging services together and allow the community to pic 'n' mix.&lt;br /&gt;&lt;br /&gt;To the institution: You already have access control systems ask your Information/Computer Systems department. You probably already have a Content Management System for educational resources for students (Blackboard? - Integrates with an LDAP server), these use external services to manage access and authentication! Here's a few services for you... LDAP, Radius, Eduroam, Domain Controller. &lt;br /&gt; &lt;br /&gt;&lt;/rant&gt; Till next time!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-3694637630389146162?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/3694637630389146162/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=3694637630389146162' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3694637630389146162'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/3694637630389146162'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/09/institutions-hate-repositories-one.html' title='Institutions hate repositories... one simple reason.'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-185258925070698172</id><published>2008-07-19T16:43:00.001-07:00</published><updated>2008-07-19T16:44:00.436-07:00</updated><title type='text'>Is winning in Casino's a Bad Thing</title><content type='html'>&lt;div xmlns='http://www.w3.org/1999/xhtml'&gt;Totally off my usual topics but by playing short games in the Casinos in Vegas and quitting while ahead i'm up by just over $100. It's not a lot but considering i've only been playing 1c machines and $5 - $10 blackjack I think that's quiet cool! However since I haven't lost yet does that mean i'll now want to continue... could it get addictive. Considering i'm going to Atlantic City next week is this bad news!&lt;br/&gt;&lt;br/&gt;As for the technical note, follow the rules of blackjack on wikipedia (http://en.wikipedia.org/wiki/Blackjack) and make sure you buy in enough for 10-12 hands at minimum bet, and never bet more unless wikipedia says you should! Also when you are up, 3-4 decks worth of rounds... get out! &lt;br/&gt;&lt;br/&gt;This may not be the longest game in the world but you take the money off the Casino! &lt;br/&gt;Thank-you Venitian/Palazzo Las Vegas!&lt;br/&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-185258925070698172?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/185258925070698172/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=185258925070698172' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/185258925070698172'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/185258925070698172'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/07/is-winning-in-casinos-bad-thing.html' title='Is winning in Casino&amp;#39;s a Bad Thing'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-6139828693528407847</id><published>2008-07-16T13:19:00.000-07:00</published><updated>2008-07-16T13:46:45.675-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#crigshow'/><category scheme='http://www.blogger.com/atom/ns#' term='agents'/><category scheme='http://www.blogger.com/atom/ns#' term='web services'/><title type='text'>#crigshow - Conference 2 - Worldcomp</title><content type='html'>&lt;span style="font-weight:bold;"&gt;Agents and Web Services... Why no collaboration?&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Out of all the presentations at worldcomp this one struck me as one of the most obvious but not covered areas for research in computer science. Probably the most well known agent system is that used by the travel industry where they have standard ways of interfacing with each other to find details of travel and hotels available on a global scale. This is no mean feat with the number of companies there are hooking into this network. &lt;br /&gt;&lt;br /&gt;So why doesn't the same exist for web services or if there is such a system why isn't everyone in the open community using it? &lt;br /&gt;&lt;br /&gt;Surely the point of web services is for people to discover and use them in their own scenarios just like the agents in the travel industry do. OK so maybe the problem lies in the fact that there are so many communities that there will never be a specific use case or framework and thus hosting a generic web service network becomes infinitely hard with the number of different APIs and Implementations. &lt;br /&gt;&lt;br /&gt;OK so if you are going to use Agents in Web Services what issues do you need to consider? Also what do you gain through doing this?&lt;br /&gt;&lt;br /&gt;One of the key ideas which came out of a talk at worldcomp is to use Agents to be the intelligent front to a web service. This enables an agent to track of a set of web services including information about a specific web service such as availability, versions, changing cost and and offline copy if the service allows this. So the agent becomes a Rendezvous Point for a series of web services. &lt;br /&gt;&lt;br /&gt;So why aren't we seeing more collaboration between the Agent community and the Web Services community?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6139828693528407847?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/6139828693528407847/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=6139828693528407847' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6139828693528407847'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6139828693528407847'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-2-worldcomp.html' title='#crigshow - Conference 2 - Worldcomp'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-8889383351258675109</id><published>2008-07-14T13:00:00.000-07:00</published><updated>2008-07-14T13:11:44.933-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='#osdiii'/><category scheme='http://www.blogger.com/atom/ns#' term='#crigshow'/><category scheme='http://www.blogger.com/atom/ns#' term='repository'/><category scheme='http://www.blogger.com/atom/ns#' term='blackboard'/><title type='text'>#crigshow - Conference 1 - Oscelot</title><content type='html'>This open source day (#osdiii) hosted by Oscelot was an unconferene which soon became based heavily around the Blackboard platform. This was expected as the majority of people attending it were then going on to attend the BbWorld conference. With the title of the conference being Open Source and yet the main topic being that of a Closed Source product this gave an opening for the CRIG team to promote the wider Open Source community to those who are focused on Blackboard use cases. &lt;br /&gt;&lt;br /&gt;The day was a success for the team as we promoted good practices in web development, standards, resource management and the fact that the people who manage an eLearning platform has a responsibility to the content they hold. &lt;br /&gt;&lt;br /&gt;From our point of view, we discovered: If blackboard is the industry leader in learning management systems then the repository community is big problems when it comes to archiving these resources by the current methodologies each community practices. &lt;br /&gt;&lt;br /&gt;More Collaboration and Awareness please!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8889383351258675109?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/8889383351258675109/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=8889383351258675109' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/8889383351258675109'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/8889383351258675109'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/07/crigshow-conference-1-oscelot.html' title='#crigshow - Conference 1 - Oscelot'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-6418976742344708314</id><published>2008-06-27T05:56:00.000-07:00</published><updated>2008-06-27T06:06:02.849-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='EPrints'/><category scheme='http://www.blogger.com/atom/ns#' term='DROID'/><category scheme='http://www.blogger.com/atom/ns#' term='OAI-ORE'/><category scheme='http://www.blogger.com/atom/ns#' term='Atom'/><category scheme='http://www.blogger.com/atom/ns#' term='OAI-PMH'/><category scheme='http://www.blogger.com/atom/ns#' term='Pronom'/><title type='text'>OAI-PMH + OAI-ORE (Atom) + Pronom Droid = Pretty</title><content type='html'>I've just finished writing a wrapper (very simple!) which takes a &lt;a href="http://www.openarchives.org/ore/"&gt;OAI-ORE&lt;/a&gt; Resource Map in Atom Format and classifies the objects which are listed in the Aggregation using the National Archives (UK) technical registry (&lt;a href="http://www.nationalarchives.gov.uk/pronom/"&gt;Pronom&lt;/a&gt;). &lt;br /&gt;&lt;br /&gt;The wrapper provides a simple front end to the &lt;a href="http://droid.sourceforge.net/wiki/index.php/Introduction"&gt;DROID tool&lt;/a&gt;, it takes an &lt;a href="http://www.openarchives.org/pmh/"&gt;OAI-PHM&lt;/a&gt; URI and requests the latest resource maps in atom format (ore-atom) and creates a list of the resources which are passed to DROID to classify directly. &lt;br /&gt;&lt;br /&gt;The wrapper requires OAI-PMH as it requests all records which have been modified since it last did a parse of the repository. This way the wrapper can be scheduled to run once a day/week/month etc. &lt;br /&gt;&lt;br /&gt;A single DROID xml file comes back as the output. &lt;br /&gt;&lt;br /&gt;This is all working with EPrints repository software currently.&lt;br /&gt;&lt;br /&gt;Next stage is to do something useful with the output xml in terms of providing useful data back to the repository manager.&lt;br /&gt;&lt;br /&gt;Total lines of source code for the wrapper: 302 :)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6418976742344708314?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/6418976742344708314/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=6418976742344708314' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6418976742344708314'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6418976742344708314'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/06/oai-pmh-oai-ore-atom-pronom-droid.html' title='OAI-PMH + OAI-ORE (Atom) + Pronom Droid = Pretty'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-6344307351894496505</id><published>2008-06-08T04:43:00.001-07:00</published><updated>2008-06-08T05:01:23.252-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='EPrints'/><category scheme='http://www.blogger.com/atom/ns#' term='Open Repositories 2008'/><category scheme='http://www.blogger.com/atom/ns#' term='Fedora'/><category scheme='http://www.blogger.com/atom/ns#' term='Preserv'/><category scheme='http://www.blogger.com/atom/ns#' term='CRIG'/><title type='text'>Repository Software is Dead</title><content type='html'>Repository Software for digital collections as we know it supplies the complete solution to the client, thus without the software you cannot access any of the data in your repository. This is a bad thing for object reuse and digital preservation! &lt;br /&gt;&lt;br /&gt;Many people at conferences such as &lt;a href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt; and from workgroups like &lt;a href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG"&gt;CRIG&lt;/a&gt; have been talking for a long while about the importance of Interoperability. However, if you get rid of the need for the interoperability and use a standard specification for accessing simple data objects (pdfs and their metadata), then you don't need interoperability! &lt;br /&gt;&lt;br /&gt;So this leads me to the fact that &lt;a href="http://www.eprints.org"&gt;EPrints&lt;/a&gt;, &lt;a href="http://www.fedora-commons.org/"&gt;Fedora&lt;/a&gt; and hopefully at some point &lt;a href="http://www.dspace.org"&gt;DSpace&lt;/a&gt; are abstracting their database and storage layers to support use of any type of storage platform. Thanks goes &lt;a href="http://www.sun.com"&gt;SUN Microsystems&lt;/a&gt; preservation action group and open storage group for pushing this work from a commercial perspective. But we need to go further than this to get rid of the need for interoperability.&lt;br /&gt;&lt;br /&gt;From &lt;a href="http://or08.ecs.soton.ac.uk"&gt;Open Repositories 2008&lt;/a&gt;, myself and a college &lt;a href="http://oxfordrepo.blogspot.com/"&gt;Ben O'Steen&lt;/a&gt; from Oxford University proved how &lt;a href="http://www.ope anarchives.org/ore/"&gt;OAI-ORE (OAI specification for Object Reuse and Exchange)&lt;/a&gt; can be used to enable high level repository interoperability. &lt;a href="http://www.preserv.org.uk/?page=oai-ore"&gt;This work won us $5000&lt;/a&gt; but more importantly got the community thinking about the true power of a specification like OAI-ORE. Ben and I are now hoping to push this work down to the low level storage such that the objects within an ORE map (documents and metadata) can be directly referenced without the need for the current repository layer. For this to happen &lt;b&gt;all objects need to be stored in their simplest form - NO WRAPPER FORMATS ALLOWED at the lowest level&lt;/b&gt;. &lt;br /&gt;&lt;br /&gt;From recent talks with Sandy Payette and Les Carr (Fedora and EPrints respectively) I am envisaging that the current repository software becomes classified as repository service software which is able to manage low level objects but is not specifically required to access these objects. So current services which plug into the repository software can act directly on the objects.&lt;br /&gt;&lt;br /&gt;A couple of problems to solve, security and consistency of cached data. All especially applicable if you have more than one piece of repository service software modifying your objects.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-6344307351894496505?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/6344307351894496505/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=6344307351894496505' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6344307351894496505'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/6344307351894496505'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/06/repository-software-is-dead.html' title='Repository Software is Dead'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-8557063529511560836</id><published>2008-06-08T04:35:00.000-07:00</published><updated>2008-06-08T04:43:01.445-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='iedemonstrator'/><category scheme='http://www.blogger.com/atom/ns#' term='CRIG'/><title type='text'>CRIG / IEDemonstator After Thoughts</title><content type='html'>IEDemonstrator is a really bad name for a project as it just says Microsoft to me but I'm fairly it isn't anything to do with that most stable of web browsers.&lt;br /&gt;&lt;br /&gt;From the workshop it has become clear to me that discussing a specification for service interaction globally is going to be impossible. This could be due to the fact that SOAP did such a good job of it and no one wants to use anything else (enough sarcasm??). I think many people left the workshop with a much better idea at how HTTP error codes (which have been around years) already go most of the way to solving a web service model. We also realised quickly that any specification would have to be built specifically for pay services (e.g. make use of the 402 code), this would then encourage companies/institutions to supply reliable services which last more than 4 years (cough AHDS cough).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-8557063529511560836?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/8557063529511560836/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=8557063529511560836' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/8557063529511560836'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/8557063529511560836'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/06/crig-iedemonstator-after-thoughts.html' title='CRIG / IEDemonstator After Thoughts'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4926451824261299693.post-1214878339545743666</id><published>2008-06-06T04:43:00.000-07:00</published><updated>2008-06-08T04:35:42.631-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='iedemonstrator'/><category scheme='http://www.blogger.com/atom/ns#' term='CRIG'/><title type='text'>First Post - CRIG DRY Workshop</title><content type='html'>Well there's a surprise!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.ukoln.ac.uk/repositories/digirep/index/CRIG_DRY_Workshop"&gt;CRIG DRY Workshop&lt;/a&gt; in Bath is where I am now. So what's happening:&lt;br /&gt;&lt;br /&gt;People have been talking about services and proposed projects to provide authoritative and complete services to users/agents/repositories. A couple of themes have come out morning session for me:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.w3.org/2004/02/skos/"&gt;SKOS: &lt;/a&gt; A lot of projects (incl. &lt;a href="http://lcsh.info"&gt;Library of Congress&lt;/a&gt;) are using this RDF language to describe subject and properties. Each provides access to this information in so many different ways it is hard to see how to interact in a constant manor.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Service Interaction&lt;/b&gt; (read on as the name is not that descriptive)&lt;br /&gt;&lt;br /&gt;This moves us on from the Open Storage stuff i've been working on (again more later in another blog post) into how we facilitate the use of services and discover how to interact with these services. We are pushing for the use of http codes! CRIG it. &lt;br /&gt;&lt;br /&gt;Tis it for now....&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/4926451824261299693-1214878339545743666?l=davetaz-blog.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://davetaz-blog.blogspot.com/feeds/1214878339545743666/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=4926451824261299693&amp;postID=1214878339545743666' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1214878339545743666'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4926451824261299693/posts/default/1214878339545743666'/><link rel='alternate' type='text/html' href='http://davetaz-blog.blogspot.com/2008/06/first-post-crig-dry-workshop.html' title='First Post - CRIG DRY Workshop'/><author><name>davetaz</name><uri>http://www.blogger.com/profile/08986118073170254013</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
