OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

regrep message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: RE: [regrep] [Namespace Manager Proposal] FW: XMLNDR [Broken URL] RE: XMLNDR PURLs, Handles, URLs, URNs & Namespaces


Link is now restored:

http://lists.oasis-open.org/archives/regrep/200201/msg00061.html

Kind Regards,
Joseph Chiusano
Booz Allen Hamilton
O: 703-902-6923
C: 202-251-0731
Visit us online@ http://www.boozallen.com
 

> -----Original Message-----
> From: Robin Cover [mailto:robin@oasis-open.org] 
> Sent: Saturday, September 03, 2005 9:09 PM
> To: Chiusano Joseph
> Cc: ebXML Regrep
> Subject: Re: [regrep] [Namespace Manager Proposal] FW: XMLNDR 
> [Broken URL] RE: XMLNDR PURLs, Handles, URLs, URNs & Namespaces
> 
> On Fri, 2 Sep 2005, Chiusano Joseph wrote:
> 
> > Back in January 2002 I submitted a proposal to our list for a 
> > "Namespace Manager" function. The archive link is forever gone, and 
> > Robin Cover asked me to send him a summary of the 
> highlights of that 
> > proposal. I am sending this to refresh our collective 
> memories of this 
> > proposal, as we look toward version 4.0 (hint hint;)
> > 
> > Ironically (as you can see from the right side of the 
> e-mail subject) 
> > I sent the archive URL in support of a discussion on persistent 
> > identifiers on the "U.S. Federal XML Naming and Design Rules" 
> > listserv, and subsequently discovered that it was a broken link.:p
> > 
> > To those in the U.S.: Happy Labor Day weekend.
> > 
> > Joe
> > 
> > Joseph Chiusano
> > Booz Allen Hamilton
> > O: 703-902-6923
> > C: 202-251-0731
> > Visit us online@ http://www.boozallen.com
> [... see 
> http://lists.oasis-open.org/archives/regrep/200509/msg00010.html ]
> 
> Joe and TC,
> 
> 1) It does appear that some 2000-2500 messages (est.) are now missing
>    from the TC's email list archive, for years 2002 and earlier
> 
> 2) I have communicated my concern to the OASIS administration, hopeful
>    that we can have the resources restored to their canonical URIs
>    within a short time.  Subsequent to posting my note referenced in
>    "3)", I observed that at least some of the messages are stored
>    on the WayBack machine, including Joe's lost posting from 
> the regrep
>    archive
> 
> 3) I have conveyed a message of [unofficial] apology to the 
> FED-XML-NDR
>    community within which this discovery of AWOL resources was
>    discovered:
> 
>    
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=449
> 
> 4) In principle, I think OASIS is committed to the practice of
>    "not breaking links" to resources for which the organization has
>    published dereferenceable URIs.  What may be needed is a policy
>    which implements, in tangible business terms, a strategy to
>    ensure that links are not broken, to monitor the websites for
>    resources gone AWOL, and to repair inadvertently broken links.
> 
> 5) The larger discussion about "broken links" on the FED-XML-NDR list
>    prompted me to write up a renewed appeal for technologists to
>    insist upon instititional commitments to the preservation of
>    public resources and the preservation of links to resources
>    identified with dereferenceable URIs.
> 
>    I append a portion of this note below, asking for the participation
>    of members on the ebXML Regrep TC, in any forum where you
>    participate, in raising consciousness about the role (I believe)
>    we play in correcting this widespread problem of resource
>    deterioration.  The deplorable situation will change, it 
> seems to me,
>    only when all of us demand commitment, policy, and vigilance on the
>    part of jurisdictions and institutions responsible for persistent
>    resources and links.
> 
>    If the perspective adopted in the note below is defensible (you
>    be the judge! - but it's optional reading), there is fundamentally
>    no excuse, in mid 2005, for broken links.  Radically revising our
>    common expectations and calling public institutions to account for
>    data loss are probably prerequisite to seeing substantial changes.
>    Shrugging our shoulders and saying "That's the Web" is no solution.
> 
> Thanks,
> 
> Robin Cover
> 
> ================= Note on resource "persistence" follows:
> 
> [...]
> 
> As a matter of personal judgment, I believe there is now 
> consensus technical opinion that accessible Web technologies, 
> informed by sound principles of Web Architecture [2], are now 
> *more than adequate* to support both resource and link 
> persistence: institutions (as URI owners) that exhibit, 
> through public behavior, doubt about the adequacy of this 
> available technology, are apparently:
> 
> a) uninformed [e.g., unaware that HTTP and free server software
>    provide the means to manage resource/link persistence]
> b) unwilling to commit (axiologically) to core principles and
>    values about resource/link persistence
> c) unwilling to commit resources to ensure resource/link
>    persistence
> d) incompetent in one or more ways possibly related to "a)" - "c)"
> e) did I miss anything in a-e ?  Maybe also: "inattentive"
>    as a sub-species of a-c
> 
> Although the terminology used to describe URI persistence is 
> a bit problematic sometimes [3], I believe there is now 
> general agreement with principles articulated in the W3C Web 
> Architecture document, viz.,
> 
>   [We share a] "social expectation that once a URI identifies
>   a particular resource, it should continue indefinitely to
>   refer to that resource."
> 
> combined with resource persistence (a dereferenced URI gets 
> you the resource consistently and predictably):
> 
>   "A URI owner SHOULD provide representations of the identified
>   resource consistently and predictably."
> 
> These values and principles were expressed in a poorly-titled 
> [4] document "Cool URIs don't change", written in 1998 [5].
> 
> The core assertion is captured in the first forty-eight words 
> of that 1998 TBL memo:
> 
>   "What makes a cool URI?
>   A cool URI is one which does not change.
>   What sorts of URI change?
>   URIs don't change: people change them.
>   There are no reasons at all in theory for people to change
>   URIs (or stop maintaining documents), but millions of
>   reasons in practice."
> 
> The plausible excuses for breaking links commonly offered by 
> people in 1998 are now, in 2005, simply lame excuses:
> 
> * "We just reorganized our website to make it better."
> * "We have so much material that we can't keep track of what is
>    out of date and what is confidential and what is valid and
>    so we thought we'd better just turn the whole lot off."
> * "Well, we found we had to move the files..."
> * "We used to use a cgi script for this and now we use a binary
>    program."
> * "I didn't think URLs have to be persistent - that was URNs."
> * "We would like to, but we just don't have the right tools."
> 
> Since 1998, web resources, tools, and technologies have 
> improved dramatically, making it easier than ever to support 
> the preservation and integrity of Web-accessible resources.
> 
> * the cost of disk storage has plummeted faster than for
>   semiconductors, according to Kryder's Law ("Moore's Law for
>   Storage"), -- the "doubling of processor speed every 18 months
>   is a snail's pace compared with rising hard-disk capacity"
>   One can get a 300 gigabyte Barracuda for just $150 [6].
> 
> * HTTP (1.0, 1.1) [7] is now widely supported by open source (free,
>   publicly available) server software which provides web site
>   administrators (as URI owners) with rich tools for managing
>   predictable, stable, and "indefinite/persistent" support
>   for published URIs, even in the face of moving web sites
>   or otherwise re-architecting enterprise web server
>   topologies.  See for example the free Apache Server [8], which
>   has binaries for UNIXes and Win32, with support for rule-based
>   URL rewriting, mapping URLs to filesystem locations, content
>   negotiation, use of virtual hosts, etc. [9]
> 
> * generic web analysis and monitoring tools now make it easy to
>   detect the existence of broken links; their statistical modules
>   help interpret the results for web site administrators, providing
>   early notification when something has gone wrong
> 
> In the context of this FED-XML-NDR discussion, within earshot 
> of some very important (government) policy-makers, I am 
> thrilled to read statements of general concern about "Broken 
> URLs."  It's a plague and a scourge upon the Web, as is 
> commonly lamented.
> Part of the solution, I believe, is to reshape public 
> expectations based upon the conviction that there are NO (or 
> few) valid excuses for breaking links.  Institutions that 
> elect to use BAD software ('broken as designed') which cannot 
> even be configured to maintain persistent links/resources 
> should be called to public account.  Institutions 
> (jurisdictions) that demonstrate negligible commitment to 
> preservation of resources and links should likewise be called 
> to public account.  IMO.
> 
> In addition to individual and institutional commitment to 
> resource persistence, it will help to design architectures 
> within which the preservation of resources and links is 
> consciously planned for, and optimized, rather than ignored.  
> I'm happy for Owen Ambur's statements of support, and for 
> Todd Vincent's words, "The system I have been describing 
> accomplishes this requirement." [10]
> 
> As as aside, I have noted the increasing popularity of RDDL 
> (Resource Directory Description Language) as a method of 
> providing something informative when designers want to create 
> http (scheme) URI namespace names that do not directly 
> resolve under HTTP/DNS to the XML Schema (resource). The RDDL 
> "namespace document" is a simple [enhanced] XHTML document 
> that lives at the end of a dereferenced http URI namespace, 
> convening information about the nature and purpose(s) of the 
> resource associates with the namespace URI, including 
> typically, (URI-reference) hyperlinks to schemas, 
> specifications, editor contacts, etc.
> 
> See http://xml.coverpages.org/rddl.html
> 
> Thanks,
> 
> Robin Cover
> [speaking only personally and unofficially] XML Cover Pages 
> http://xml.coverpages.org/
> 
> * please send corrections to this message via email:
>   robin@oasis-open.org
> 
> ========= References:
> 
> [1] 
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=448
>     
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=445
>     
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=442
> 
>     
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=443
> 
>     "I just found out that the URL I provided is now broken;
>     I will try and find out from OASIS how to retrieve it from
>     their architectures (it's broken in the OASIS ebXML
>     Registry listserv public archives as well). Sorry for the
>     inconvenience."
> 
> [2] URI persistence
>     Architecture of the World Wide Web, Volume One
>     W3C Recommendation 15 December 2004
>     http://www.w3.org/TR/webarch/#URI-persistence
> 
> [3] "persistence" in connection with "link", "URI", "resource",
>     and "representation" is potentially a confusing term because
>     it does not clearly specify what "persists."  The phrases
>     "stability," "predictability," and "consistent" are used
>     in the W3C Web Architecture document to address the notion
>     of persistence. "Policy and commitment on the part of the
>     URI owner" are also foundational concepts.  Excerpt:
>     
>     "confidence in interactions via the Web depends on stability
>     and predictability. For an information resource, persistence
>     depends on the consistency of representations...Although
>     persistence in this case [Representation Management] is
>     observable as a result of representation retrieval, the term
>     URI persistence is used to describe the desirable property
>     that, once associated with a resource, a URI should continue
>     indefinitely to refer to that resource...
> 
>     Good practice: Consistent representation. A URI owner SHOULD
>     provide representations of the identified resource
>     consistently and predictably.
>     
>     URI persistence is a matter of policy and commitment on the
>     part of the URI owner...
>     
>     HTTP [RFC 2616] has been designed to help manage URI persistence.
>     For example, HTTP redirection (using the 3xx response codes)
>     permits servers to tell an agent that further action needs to
>     be taken by the agent in order to fulfill the request
>     (for example, a new URI is associated with the resource)...
>     In addition, content negotiation also promotes consistency, as
>     a site manager is not required to define new URIs when adding
>     support for a new format specification
> 
>     For more discussion about URI persistence, see "Cool URIs
>     don't change" 1998, by Tim BL [at]
>     http://www.w3.org/Provider/Style/URI.html 
> 
>     A "consistent representation" that provides stability and
>     predictability for a URI-dereferenceable resource means
>     not only indefinite association of a URI with a resource,
>     but indefinite support for retrieval via dereferencing.
> 
>     In this context we are not concerned with URIs that are
>     not initially intended by URI owners as identifiers for
>     dereferenceable resources, per:
>     http://www.w3.org/TR/webarch/#representation-management
>     Representation Management: "Just because representations
>     are available does not mean that it is always desirable
>     to retrieve them..."
> 
> [4] the authors of the W3C Web Architecture document
>     acknowledge that the title "Cool URIs don't change"
>     is infelicitous: "Note that the title is somewhat misleading.
>     It is not the URIs that change, it is what they identify."
>     http://www.w3.org/TR/webarch/#Cool
> 
> [5] "Cool URIs don't change" 1998, by Tim BL
>     http://www.w3.org/Provider/Style/URI.html
> 
> [6] cost of disk/mass storage
> http://en.wikipedia.org/wiki/Kryder%27s_Law
> http://en.wikipedia.org/wiki/Moore's_law
> http://www.storagereview.com/
> 
> ST3300831AS 300GB Barracuda  $150 [2005-09-03, online] 
> 8-gigabyte 1-inch microdrives 60-gigabyte 1.8-inch Slim Bling 
> hard drive
> 
> http://tinyurl.com/extmh
> Kryder's Law, by Chip Walter
> The doubling of processor speed every 18 months is a snail's 
> pace compared with rising hard-disk capacity, and Mark Kryder 
> plans to squeeze in even more bits... By 1998, when Kryder 
> joined Seagate to form its advanced research center, the DSSC 
> had set an even loftier target: crowd 100 gigabits into a 
> square inch by the early 21st century. In 2005, just seven 
> years later, Seagate began shipping 110-gigabit drives. 
> Inside of a decade and a half, hard disks had increased their 
> capacity 1,000-fold, a rate that Intel founder Gordon Moore 
> himself has called "flabbergasting."
> 
> [7] http://www.ietf.org/rfc/rfc2616.txt
>     http://www.w3.org/1999/07/HTTP-PressRelease  July 07, 1999
>    "World Wide Web Consortium Supports HTTP/1.1 Reaching IETF 
> Draft Standard"
>  
> 
> [8] http://httpd.apache.org/
>     Apache HTTP Server Project "The Number One HTTP Server on 
> the Internet"
> 
>     The Apache HTTP Server Project is a collaborative software
>     development effort aimed at creating a robust,
>     commercial-grade, featureful, and freely-available source
>     code implementation of an HTTP (Web) server... Apache has
>     been the most popular web server on the Internet since
>     April of 1996. The February 2005 Netcraft Web Server Survey
>     found that more than 68% of the web sites on the Internet
>     are using Apache, thus making it more widely used than all
>     other web servers combined."
> 
> [9] http://httpd.apache.org/docs/2.0/misc/rewriteguide.html
>     http://httpd.apache.org/docs/2.0/mod/mod_rewrite.html
>     URL Rewriting Guide
>     
>     A versatile and powerful set of tools for: forcing the use of
>     canonical hostnames; creating a homogeneous and consistent
>     URL layout over all WWW servers on a Intranet webcluster;
>     redirecting *just* all homedirs on one webserver to another
>     webserver; redirect homedir URLs to another webserver when
>     the requesting user does not stay in the local domain;
>     filesystem reorganization; redirecting failing requests on
>     webserver A to webserver B; extended redirection supporting
>     character escaping mechanisms; time-dependent rewriting;
>     load balancing; on-the-fly content-regeneration; seamless
>     transformation from static to dynamic; mass virtual hosting;
>     use of an external rewriting engine; etc
>     
>     http://httpd.apache.org/docs/2.0/content-negotiation.html
>     Content Negotiation
>     
>     http://httpd.apache.org/docs/2.0/urlmapping.html
>     Mapping URLs to Filesystem Locations
>     
>     http://httpd.apache.org/docs/2.0/vhosts/
>     Apache Virtual Host documentation
> 
> [10] statements from Owen Ambur and Todd Vincent
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=444
> https://fed-xml-ndr.core.gov/servlets/ReadMsg?list=listserv&msgNo=446
> 
> 
> 


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]