Network Working Group                                  J. Foster, Editor
Request for Comments: 1689             University of Newcastle upon Tyne
RARE Technical Report: 13                                    August 1994
FYI: 25
Category: Informational


                            A Status Report
                                   on
           Networked Information Retrieval: Tools and Groups


     Produced as a collaborative effort by the Joint IETF/RARE/CNI
        Networked Information Retrieval - Working Group (NIR-WG)

Status of this Memo

   This memo provides information for the Internet community.  This memo
   does not specify an Internet standard of any kind.  Distribution of
   this memo is unlimited.

Abstract

   The purpose of this report is to increase the awareness of Networked
   Information Retrieval by bringing together in one place information
   about the various networked information retrieval tools, their
   developers, interested organisations, and other activities that
   relate to the production, dissemination, and support of NIR tools.
   NIR Tools covered include Archie, WAIS, gopher and World Wide Web.

Table of Contents

    1.   Introduction ..............................................   2
    2.   How the information was collected .........................   3
    3.   What is covered? ..........................................   3
    4.   Updating information ......................................   5
    5.   Overview of the types of NIR Tool .........................   5
    6.   NIR Tools .................................................   9
    7.   NIR Groups ................................................ 123
    8.   Security Considerations ................................... 180
    9.   Acknowledgements .......................................... 180
   10.   Author's Address .......................................... 180
   11.   Appendix A: NIR Tool Template ............................. 181
   12.   Appendix B: NIR Group Template ............................ 188
   13.   Appendix C: Email Lists and Newsgroups .................... 192
   14.   Appendix D: Coming Attractions ............................ 207
   15.   Appendix E: Extinct Critters (Tools) ...................... 222
   16.   Appendix F: Extinct Critters (Groups) ..................... 222


Foster                                                          [Page 1]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


1.   Introduction

   As the network has grown, along with it there has been an increase in
   the number of software tools and applications to navigate the network
   and make use of the many, varied resources which are part of the
   network.  Within the past two and a half years we have seen a
   widespread adoption of tools such as the archie servers, the Wide
   Area Information Servers (WAIS), the Internet gopher, and the
   Worldwide Web (WWW).  In addition to the acceptance of these tools
   there are also diverse efforts to enhance and customise these tools
   to meet the needs of particular network communities.

   There are many organisations and associations that are focusing on
   the proliferating resources and tools for networked information
   retrieval (NIR).  The Networked Information Retrieval Group is a
   cooperative effort of three major players in the field of NIR: The
   Internet Engineering Task Force (IETF), the Association of European
   Research Networks (RARE) and the Coalition for Networked Information
   (CNI), specifically tasked to collect and disseminate information
   about the tools and to discuss and encourage cooperative development
   of current and future tools.

   The purpose of this report is to increase the awareness of NIR by
   bringing together in one place information about the various
   networked information retrieval tools, their developers, interested
   organisations, and other activities that relate to the production,
   dissemination, and support of NIR tools.  The intention is to make
   this a "living document".  It will be held on-line so that each
   section may be updated separately as appropriate.  In addition, it is
   intended that the full document will be updated once a year so that
   it provides a "snapshot" report on activities in this area.

   Whilst the NIR tools in this report are being used on a wide variety
   of information sources including files and databases there remains
   much that is currently not accessible by these means.  On the other
   hand, the majority of the NIR Tools described here are freely
   available to the networked Research and Education community.  Tools
   for accessing specialised datasets are often only available at a
   cost.

   It should be noted that in many ways networked information retrieval
   is in its infancy compared with traditional information retrieval
   systems.  Thesaurus construction, boolean searching and
   classification control are issues which are under discussion for the
   popular NIR Tools but as yet are not in widespread use.  However it
   should be said that, with the vast amount of effort that is currently
   going into the NIR field, rapid progress is being made.  Much work is
   currently being done on expanding some of the NIR tools to include


Foster                                                          [Page 2]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


   handling of multimedia information services.  Progress has also been
   made in the discussions on classifying and cataloguing electronic
   information resources.

2.  How the information was collected

   The information contained in this report was collected over the
   network from the contacts for each NIR Tool or Group using two
   templates:

     - the NIR Tool Template, included in Appendix A;
     - the NIR Group Template, included in Appendix B.

   The contents of these templates were discussed by the NIR WG in
   Boston (July, 1992) and subsequently on the email list.  (See the
   Section on the NIR-WG for details of how to join this mailing list.)
   The initial draft report was discussed at the NIR Working Group in
   Washington (November, 1992) and updated and added to at subsequent WG
   meetings.  Before the final submission as an RFC the individual
   templates were reviewed by independent reviewers from around the
   world.  Their efforts are acknowledged in Section 9.

   The NIR Tool template was used to collect the information necessary
   to identify and track the development of networked information
   retrieval tools.  This template asked for information such as how and
   where to get the software for each NIR Tool, documentation,
   demonstration sites, etc.  The main part of the template has been
   completed by the main individual responsible for the tool.  Sections
   of the template (e.g., on clients) may have required completion by
   others.

   The NIR Group template requested information on the aim and purpose
   of the group, the current tasks being undertaken, mailing lists,
   document archives, etc.

3.  What is covered?

   In the current report you will find information on the following NIR
   tools:

      Alex
      archie
      gopher
      Hytelnet
      Netfind
      Prospero
      Veronica
      WAIS  (including freeWAIS)


Foster                                                          [Page 3]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      WHOIS
      World Wide Web  (including MOSAIC)
      X.500 White Pages

      Appendix D covers "Forthcoming Attractions":
         Hyper-G
         Soft Pages
         WHOIS++

 and the following NIR Groups:

      CNI          Coalition for Networked Information (CNI)
                   Architectures and Standards
                   Directories and Resource Information Services
                   TopNode for Networked Information Resources,
                                                  Services and Tools

      CNIDR        Clearinghouse for Networked Information Discovery
                                                       and Retrieval

      IETF         Integrated Directory Services (IDS)
                   Integration of Internet Information Resources (IIIR)
                   Networked Information Retrieval (NIR)
                      joint IETF/RARE WG
                   Network Information Services Infrastructure (NISI)
                   OSI-Directory Service (OSI-DS)
                   Uniform Resource Identifiers (URI)
                   Whois and Network Information Lookup Service (WNILS)

      IRTF         Internet Research Task Force Research Group on
                     Resource Discovery and Directory Service (IRTF-RD)

      NISO         Z39.50 Implementors Group

      RARE         Information Services and User Support Working Group
                     (ISUS)

      USMARC/OCLC  USMARC Advisory Group; OCLC Internet Resources
                        Cataloging Experiment (USMARC/OCLC)

   Appendix C contains a list of the relevant email lists and Appendix D
   contains information on "Coming Attractions" which are NIR tools not
   yet in widespread use.


Foster                                                          [Page 4]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


4.  Updating Information

   Updates on and additions to the information contained in this report
   are welcome. CNIDR have agreed to host the report and to accept
   updates to individual templates from the template maintainers.  Send
   updates using the appropriate template (from Appendix A or Appendix B
   of this report) to:

   nir-updates@cnidr.org

   The current templates and this report may be retrieved from the UK
   Mailbase Server:

   Via anonymous ftp (use your email address as the password):

     URL: ftp://mailbase.ac.uk/pub/lists/nir/files/tool.template
     URL: ftp://mailbase.ac.uk/pub/lists/nir/files/group.template
     URL: ftp://mailbase.ac.uk/pub/lists/nir/files/nir.status.report

   or via gopher or World Wide Web to mailbase.ac.uk

   or via email:

     Mail to:  mailbase@mailbase.ac.uk

  Text of the message:

     send nir tool.template
     send nir group.template
     send nir nir.status.report

5.  Overview of the types of NIR Tools

   The following is an overview of major networked information retrieval
   (NIR) tools available on the Internet.  There are many excellent
   books which discuss the Internet and NIR Tools in detail.  Such books
   include "The Whole Internet User's Guide and Catalog" by Ed Krol and
   published by O'Reilly and Associates, Inc and "The Internet Guide for
   New Users" by Daniel Dearn and published by Meckler.

   The number of these NIR tools is large and growing quickly.  Certain
   techniques reappear regularly and seemingly different tools may
   perform similar tasks, allowing a simple classification of projects
   encompassing most of the existing tools and services.


Foster                                                          [Page 5]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


   The classification presented here is only one possible ordering.  The
   goal is to define in broad outlines what can be done with particular
   tools, realizing that users will always find novel unanticipated ways
   of applying them.

   Interactive Information Delivery Services (Gopher, World Wide Web)

      Basic Internet services such as electronic mail and anonymous FTP
      can be used to share information across the Internet, but neither
      allows simple browsing and neither is particularly easy for the
      newcomer to learn to use.  Gopher and the World Wide Web (W3) are
      two recent developments that attempt to make it easier to
      distribute information over the Internet.  Both allow the user to
      browse information across the network without the necessity of
      logging in or knowing in advance where to look for information.

      The Gopher project was first developed at the University of
      Minnesota to provide a simple campus-wide on-line information
      system.  Gopher represents information as a simple hierarchy of
      menus and files.  It has limited capability to recognize different
      types of files, allowing, for example, the display of selected
      types of image files.  Gateways to other services are provided
      (usually in a manner that is transparent to the user).  The
      underlying Gopher protocol is simple, and has facilitated the
      creation of freely available clients for use on a variety of
      hardware platforms and operating systems.  The more recent Gopher+
      protocol adds the ability to provide documents in alternate forms
      (PDF, PostScript, RTF, Word).  These features and the ease of
      installing and administering gopher servers has led to an
      explosive growth of gopher sites since its initial deployment.  As
      of November 1993, there were over 2200 known servers.

      World Wide Web relies on hypertext; formatted documents are
      displayed, and hypertext links within the document can be selected
      to travel from the current document to another.  W3 allows a user
      to annotate documents (using hypertext links), provides gateways
      to other services, and has multimedia support (for example, on
      appropriate hardware platforms it can intermix text and images in
      a displayed document).  There is a range of free W3 clients,
      supporting many environments.  World Wide Web was originally
      developed at CERN for the High Energy Physics Community.

      Gopher and WWW share a maintenance problem in that there is no
      automated way to update links to other documents when those
      documents are moved or removed.


Foster                                                          [Page 6]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


   Directory Services (WHOIS, X.500)

      Directory Service tools are intended to provide a lookup service
      for locating information about users (often referred to as White
      Pages), or services and service providers (Yellow Pages).  For
      example, a White Pages service might be used to locate an
      electronic mail address, given a name and organization, while a
      Yellow Pages service could be used to locate an online library
      catalog or file archive site.

      One of the first directory services deployed on the Internet was
      WHOIS, a simple White Pages service created to track key network
      contacts for the early DARPA-sponsored incarnation of the
      Internet.  A number of sites currently operate WHOIS servers,
      based on a range of extensions and enhancements to the original
      model.  WHOIS enjoys the advantages of simplicity and the presence
      of WHOIS client software on a preponderance of Internet-connected
      hosts.  Work is underway on a more powerful protocol, known as
      WHOIS++, which is backwards-compatible with WHOIS.

      The X.500 Directory Service is a much more ambitious Directory
      project that has been under development for a number of years
      under the aegis of ISO/OSI.  Implementations, concerned primarily
      with White pages services, are available in the public domain and
      from commercial sources.  There are LDAP based X.500 clients
      available for most major platforms, as well as a LDAP based gopher
      gateway to X.500.

      Despite years of effort, there is still no single White Pages
      Directory Service for the entire Internet; Yellow Pages services
      remain even less well developed and deployed.  The cost of setting
      up the service is one obstacle; maintaining the required databases
      is even more daunting.

   Indexing Services (archie, Veronica, online library catalogs)

      There are several Internet-based projects that build indexed
      catalogs of information to facilitate searching and retrieval.
      The first such services provided network access to library card
      catalogs, with more recent projects indexing network-based
      information.

   archie:

      The archie service began as a simple project to catalog the
      contents of hundreds of ftp-accessible online file archives.  The
      archie service gathers location information, name, and other
      details describing such files and creates an index database.


Foster                                                          [Page 7]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      Users can contact an archie server and search this database for
      files they require.

      The archie service is accessible through a range of access
      methods, including telnet, stand-alone client programs running on
      a user's own machine, gopher, WWW, or via electronic mail.  The
      initial implementation of archie tracks over 2,100,000 filenames
      on over 1,200 sites around the world (as of November 1993).  There
      are about 30 (geographically distributed) archie servers.  Both
      commercial and freely available versions of the archie client
      software are available.

      Work continues on extending the archie service to provide
      additional types of information.  The latest version is being used
      to provide a prototype Yellow Pages service and directories of
      online library catalogs and electronic mailing lists.

   Veronica:

      Veronica arose as an attempt to do for the world of Gopher what
      archie did for the world of ftp.  A central server periodically
      scans the complete menu hierarchies of Gopher servers appearing on
      an ever-expanding list (over 2000 sites as of November 1993).  The
      resulting index is provided by a veronica server and can be
      accessed by any gopher client.

   Online library catalogs:

      A large number of libraries make their computerized library
      catalogs available over the Internet.  Most are available through
      telnet sessions in which the user connects to a specific address
      and logs in using a specific login name.  Some are also available
      through other tools, such as Gopher.

   Text-based Indexing Services (WAIS)

   WAIS:

      Wide Area Information Servers (WAIS) is a system for indexing and
      serving information in a network-based environment.  It is
      distinct from indexing tools such as archie and veronica in that
      it is used to index text-based target documents on a server, as
      well as descriptions of the contents of a server.

      A WAIS server allows the administrator to set up an index of the
      documents (or resources) to be published.  The user employs a WAIS
      client to attach to a particular WAIS server, and specifies a
      search pattern which is matched against the server's index.  In


Foster                                                          [Page 8]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      early WAIS clients, searches are specified as simple natural-
      language queries; common ("stop") words are removed, and Boolean
      "ORs" are implicitly added between the remaining list of words.
      Matching documents are rank-ordered according to a simple
      statistical weighting scheme which attempts to indicate likely
      relevance.  The user may choose to view selected documents, or
      further refine the search.  The results of one search may be used
      to successively refine future searches ("relevance feedback").
      Gopher clients can also access WAIS servers via a transparent
      gateway.

      Both freely available and commercial versions of WAIS servers and
      clients are available.  Current work is attempting to add Boolean
      expressions and proximity and field specifications to queries.

      There are currently (as of November 1993) some 500 registered WAIS
      databases with an estimated 2000 additional databases that are not
      yet registered.  There are approximately another 100 commercial
      WAIS databases.

6.  NIR Tools

   This section contains detailed information about the various NIR
   Tools.  It is ordered alphabetically.

 =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

 ALEX

 Date template updated or checked:  19th March, 1994
 By: Name:             Vincent Cate
     Email address:    vac@cs.cmu.edu

 ----------------------------------------------------------------------

 NIR Tool Name:      Alex

 Brief Description of Tool:

   OVERVIEW:

      The Alex filesystem provides users and applications transparent
      read access to files in anonymous FTP sites on the Internet.
      Today there are thousands of anonymous FTP sites with a total of a
      few millions of files and roughly a terabyte of data.  The
      standard approach to accessing these files involves logging in to
      the remote machine.  This means that an application can not access
      remote files like local files.  This also means that users do not


Foster                                                          [Page 9]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      have any of their aliases or local tools available.  Users who
      want to use an application on a remote file first have to manually
      make a local copy of the file.  There is no mechanism for
      automatically updating this local copy when the remote file
      changes.  The users must keep track of where they get their files
      from and check to see if there are updates, and then fetch these.
      In this approach many different users at the same site may have
      made copies of the same remote file each using up disk space for
      the same data.

      Alex addresses the problems with the existing approach while
      remaining within the existing FTP protocol so that the large
      collection of currently available files can be used.  To get
      reasonable performance long term file caching is used.  Thus
      consistency is an issue.  Traditional solutions to the cache
      consistency problem do not work in the Internet FTP domain:
      callbacks are not an option as the FTP protocol has no provisions
      for this and polling over the Internet is slow.  Therefore, Alex
      relaxes file cache consistency semantics, on a per file basis, and
      uses special caching algorithms that take into account the
      properties of the files and of the network to allow a simple
      stateless filesystem to scale to the size of the Internet.

   USER'S VIEW:

      To a user or application, Alex is just a normal filesystem.  Any
      command that works on local files will work on Alex files.  Since
      Alex is a real filesystem, nothing needs to be recompiled and no
      libraries are changed.  Thus, users can apply all of their
      existing skills and tools for using files.

      The user sees a filesystem with a hierarchical name space.  At the
      top level (/alex) there are top-level Internet domains like "edu",
      "com", "uk", and "jp".  Each component of the hostname becomes a
      directory name. Then the remote path is added at the end.  If the
      user does a "ls /alex/edu/berkeley" he sees some machine names
      such as "ucbvax" and "sprite" and some directories on
      berkeley.edu.  From the "ls" it is not clear what is where.  The
      user may or may not be aware of host boundaries.

   INFORMATION PROVIDER'S VIEW:

      Alex is implemented as a user level NFS server.  NFS was chosen
      because it makes it easy to add Alex to a wide range of machines.
      Most machines can simply use the mount command.


Foster                                                         [Page 10]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      The model of usage is that there is one Alex server running at
      each institution (though this is not required in any way).  Users
      mount the local server which caches files for users at that site.

      Any information put into any anonymous FTP site becomes available
      via Alex.

 ----------------------------------------------------------------------

 Primary Contact(s):

  Name:                 Vincent Cate

  Email address:        vac@cs.cmu.edu

  Postal Address:       School of Computer Science
                        5000 Forbes Ave.
                        Pittsburgh PA, 15213

  Telephone:            +1-412-268-3077

  Fax:                  +1-412-681-1998

 ----------------------------------------------------------------------

 Help Line:

  At this time Alex is a one person project (Vince).

 ----------------------------------------------------------------------

 Related Working Groups:

  Maybe the FTP working group.

 ----------------------------------------------------------------------

 Sponsoring Organization / Funding source:

  Defense Advanced Research Projects Agency, Information Science and
  Technology Office, under the title "Research on Parallel Computing,"
  ARPA Order No.  7330.  Work furnished in connection with this research
  is provided under prime contract MDA972-90-C-0035 issued by DARPA/CMO
  to Carnegie Mellon University.  Vincent Cate is supported by an "Intel
  foundation graduate fellowship".

 ----------------------------------------------------------------------


Foster                                                         [Page 11]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


 Mailing Lists:

  Address:              alex-servers@cs.cmu.edu

  Administration:       alex-servers-request@cs.cmu.edu


  Description:          alex-servers is for people setting up an Alex
                        fileserver.

  Archive:              alex.sp.cs.cmu.edu (128.2.209.13)

 ----------------------------------------------------------------------

 News groups:

  None.

 ----------------------------------------------------------------------

 Protocols:

  What is supported:   Any machine that can NFS mount a fileserver.

  What it runs over:   Unix machine and FTP

  Other NIR tools this interworks with:

   Uses FTP sites.

   WAIS can be used to index files in Alex
    (this was done for ftpable-readmes and cs-techreports WAIS servers)

      New versions of archie can output Alex paths.

 Future plans:         Graduate from CMU.

 ----------------------------------------------------------------------

 Servers:

  Date completed or updated:    19 March 1994
  By: Name:                     Vincent Cate

  Platform:                     UNIX

  Primary Contact:
  Name:                         Vincent Cate


Foster                                                         [Page 12]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  Email address:                vac@cs.cmu.edu
  Telephone:                    +1-412-268-3077

  Server software available from:  alex.sp.cs.cmu.edu

  Location of more information:
   No other place to go to.

  Latest version number:
   New versions all the time.

  Brief Scope and Characteristics:
   This software is known to still contain bugs.

  Approximate number of such servers in use:
   200.

  General comments:
   You can use lpr, make, grep, more, etc. on files around the world.

 ----------------------------------------------------------------------

 Clients:

  You just do an NFS mount of the server.  No client software
  is needed.

 ----------------------------------------------------------------------

 Demonstration sites:

  Site name:   alex.sp.cs.cmu.edu

  Access details - do the following as root:
   mkdir /alex
   mount -o timeo=30,retrans=300,soft,intr alex.sp.cs.cmu.edu:/ /alex

  Example use:
   ln -s /alex/edu/cs/cmu/sp/alex/links alexlinks
   cd alexlinks
   ls
   cd cs-tr
   cd ls
   cd purdue
   ls
   lpr TR758.PS


Foster                                                         [Page 13]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  If you like Alex and want to use it regularly please find, or set up,
  an Alex fileserver at/near your site.

 ----------------------------------------------------------------------

 Documentation:

  ftp://alex.sp.cs.cmu.edu/www/alex.html
  ftp://alex.sp.cs.cmu.edu/doc/intro.ps
  ftp://alex.sp.cs.cmu.edu/doc/NIR.Tool
  ftp://alex.sp.cs.cmu.edu/doc/alex.post

 ----------------------------------------------------------------------

 Bibliography:

  @InProceedings{cate:alex,
  author =      "Vincent Cate",
  title =       "Alex - a Global Filesystem",
  booktitle =   "Proceedings of the Usenix File Systems Workshop",
  year =        1992,
  pages =       "1--11",
  month =       may,
  place =       "Ann Arbor, MI",
  keyword =     "distributed file system, wide-area file system"

 ----------------------------------------------------------------------

 Other Information:

  FTP to alex.sp.cs.cmu.edu and "cd to doc".  Get the "README" or
  anything else there.  A current version of this document may be there
  and called "NIR.Tool".  In Alex this file is named
  "/alex/edu/cmu/cs/sp/alex/doc/NIR.Tool".

 =-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=


Foster                                                         [Page 14]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


 ARCHIE

 Date template updated or checked:       1 March, 1994
 By: Name:                               Peter Deutsch
     Email address:                      peterd@bunyip.com

 -----------------------------------------------------------------------

 NIR Tool Name:                  archie

 Brief Description of Tool:

   The archie system is a tool for gathering, indexing and serving
   information from around the Internet.  The current version serves a
   collection of filenames found at anonymous FTP sites, as well as a
   smaller collection of text descriptions for software, data and other
   information found at anonymous FTP archives.  Additional databases
   are under development.

   User's View:

      Users run a client program to connect to an archie server and
      issue search commands to find information in an archie database.
      In the case of an anonymous FTP filename, this information can
      then be used to fetch the file directly from the archive site
      using the `ftp' command.  To the user, archie could be seen as a
      `secondary source' of information which, because of the high cost
      of locating and serving, would not otherwise be available.

      The user searches the archie databases through either a telnet
      session to a machine running an archie server, or by using a
      stand-alone client program (which uses the Prospero protocol for
      sending and receiving requests).  There is also an email interface
      which allows users to send and receive search requests via
      electronic mail.

      Freely available archie clients exist for most operating systems
      and can be fetched using anonymous FTP from most of the current
      archie servers.  There are also gateways to the archie system from
      many other NIR tools, including Gopher, WAIS and WWW.  An X.500
      interface to archie is currently under development.

   Information Provider's View:

      There are two types of information providers who would be
      interested in archie.  Primary information providers are
      interested in having a summary of the information provided by
      their service tracked by an archie server.  Secondary service


Foster                                                         [Page 15]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


      providers, or those sites wishing to provide a "value-added"
      service for the Internet can elect to run an archie server at
      their site to provide a useful service to users, to raise the
      profile of their institution on the Internet, or to provide market
      differentiation (for commercial service providers).

      The archie system is of particular utility serving information
      where there are many sites to be searched and/or where the cost of
      searching each site is high.

      For example, there are currently over 1,200 anonymous FTP sites on
      the Internet, and the number continues to grow.  Searching for a
      specific filename at a single site may involve scanning hundreds,
      or even thousands of filenames.  Thus, most operators of anonymous
      FTP archives welcome the fact that archie indexes and serves the
      names of all files available from each site tracked.

   Information Types Supported:

      The archie system allows the gathering and serving of arbitrary
      information types, although the current system serves only
      freeform text and a dedicated text format for filename listings.
      Internally, the archie system now supports a WAIS search engine
      and frontends for Gopher, WWW and WHOIS++ for accessing archie
      information through Gopher clients is now being tested.
      Additional collections of information to be served by the archie
      software will be announced.

 -----------------------------------------------------------------------

 Primary Contact(s):

  Name:                   Archie Group, Bunyip Information Systems Inc.

  Email address:          info@bunyip.com

  Postal  Address:        Bunyip Information Systems Inc.,
                          310 St-Catherine St. West, suite 202,
                          Montreal, QC
                          CANADA H2X 2A1

  Telephone:              +1-514-875-8611
  Fax:                    +1-514-875-8134

 -----------------------------------------------------------------------

 Help Line:       for archie server system and telnet client


Foster                                                         [Page 16]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  Name:           Archie Group, Bunyip Information Systems Inc.

  Email address:  info@bunyip.com

  Telephone:      +1-514-875-8611


  Level of support offered:
                  o commercial support for server
                   (primarily for systems maintainers)

                  o voluntary helpdesk support for freeware clients

                  o volunteer helpdesk support for Internet information
                   gathering tools in general

  Hours available:        - server system:
                           email:                24 hour support
                           phone support:        9-5 EST

                         - helpdesk consultation: as time permits

 ----------------------------------------------------------------------

 Related Working Groups:

  IETF, IIIR, WNILS, URI.

 ----------------------------------------------------------------------

 Sponsoring Organisation / Funding source:

  Bunyip Information Systems Inc.

  Funded by licensing of archie software and development contracts from
  sponsors.  Additional information services based upon this software
  are now being tested.

 ----------------------------------------------------------------------

 Mailing Lists:

  Address:              archie-people@bunyip.com

  Administration:       archie-people-request@bunyip.com

  Description:


Foster                                                         [Page 17]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


   This mailing list is for people interested in the archie project and
   its future developments.  Announcements of upgrades, new services,
   etc. are made to this list.

  Archive:              none

                        -------------------

  Address:              archie-maint@bunyip.com

  Administration:       archie-maint-request@bunyip.com

  Description:

   This mailing list is for people who operate and maintain archie
   servers.  Announcements of bug fixes, new releases and discussion of
   new features are carried out on this list.

  Archive:
   "archives.cc.mcgill.ca:/pub/mailing-lists/archie-maint"

                        -------------------

  Address:              iafa@bunyip.com

  Administration:       iafa-request@bunyip.com

  Description:

   This mailing list is for people who are involved in the Internet
   Anonymous FTP Archives Working Group of the IETF.  This group was
   involved in standardizing the encoding of information at anonymous
   FTP archives and thus is of interest to operators and users of the
   archie system.  It came to completion in November, 1992 and produced
   two documents which have been presented to the IETF as informational
   RFCs.

  Archive:              "archives.cc.mcgill.ca:/pub/mailing-lists/iafa"

 ----------------------------------------------------------------------

 News groups:

  Name:                 comp.archives.admin


Foster                                                         [Page 18]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  Description:

  This newsgroup is for operators and maintainers of Internet archives.
  Announcements and discussions of issues related to archie are
  presented here, as well as discussions of more general issues
  relating to archiving and Internet services.

  Archive:              not known

                        -------------------

  Name:                 alt.internet.services

  Description:

  This newsgroup is for people interested in Internet-related services,
  with a focus at the user level.  Announcements and discussions of
  issues related to archie are presented here, as well as discussions
  of more general issues relating to Internet services.

  Archive:              not known

 ----------------------------------------------------------------------

   Protocols:

   What is supported:

      The current archie system clients use the Prospero protocol for
      communication with the search engine on the archie server.  Freely
      available clients are available which include source to perform
      this communication for those wishing to implement additional
      clients.

      The archie server is capable of building arbitrary databases,
      using arbitrary search and access engines and the current release
      ships with the public domain implementation of WAIS.  We expect
      future archie servers to serve information using this protocol.
      The current server system assumes the TCP/IP protocol suite is
      available, and in particular the ftp protocol for data gathering.

      The archie system can be accessed through systems operating the
      Gopher, WAIS and WWW (HDDL) protocols.  A gateway from the X.500
      system is under development.


Foster                                                         [Page 19]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


   What it runs over:

      The Prospero protocol implementation runs over its own
      implementation of a reliable datagram protocol based upon UDP.
      Data gathering runs over the TCP/IP protocol suite.

   Other NIR tools this interworks with:

      Prospero, Gopher, WAIS, WWW.

   Future plans:

      The archie system became a commercial product in October, 1992,
      marketed by Bunyip Information Systems Inc.  The company plans to
      market additional data gathering modules to allow the server code
      to build additional types of databases.  Work is also underway to
      integrate extensions to WHOIS to allow the building and
      maintaining of White Pages (names) directories.  The company is
      also working on other Internet information tools that will work
      with the archie system.

 -----------------------------------------------------------------------

 Servers:

  Date completed or updated:    1 November, 1993
  By: Name:                     Peter Deutsch
      Email address:            peterd@bunyip.com

  Platform:                 Sun SPARC running SunOS 4.1 or later.
                            IBM RS6000 running AIX version 3.2 or later.
                            for additional UNIX platforms, contact
                            Bunyip Information Systems details.

  Primary Contact:
  Name:                         Alan Emtage
  Email address:                bajan@bunyip.com
  Telephone:                    +1-514-398-8611

  Server software available from:
   Bunyip Information Systems Inc.
   email:  info@bunyip.com

  Location of more information:

  Additional information on the archie product line is available from
  the anonymous ftp archives on the various archie server sites. Try
  "archie.ans.net", "archie.sura.net", "archie.au", etc.


Foster                                                         [Page 20]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  Latest version number:          archie 3.1

  Brief Scope and Characteristics:

  This is the  commercial inmplementation of the archie system,
  replacing a version done as a Masters project at McGill University
  during the period 1990-1992.  It comes with an archie telnet client
  that offers a number of minor improvements over earlier versions.
  Additional releases, with a number of additional improvements, are
  planned in the coming months.

  Approximate number of such servers in use:
   Currently about 27 (not all are publicly available)

  General comments:

  Most users access archie through a freeware or public domain client
  program.  These are available from most archie servers via anonymous
  FTP.  Check out the archie directory on any of the publicly available
  archie servers or the banner message when logging into any of the
  archie telnet clients for more details.

 ----------------------------------------------------------------------

 Clients:

  Date completed or updated:    1 November, 1993
  By: Name:                     Peter Deutsch
      Email address:            peterd@bunyip.com

  Platform:                     command line shell, written in C. Works
                                with both UNIX and MSDOS/OS2 shells.

  Primary Contact:
  Name:                         Brendan Kehoe
  Email address:                brendan@cygnus.com
  Telephone:                    not known

  Client software available from: most archie server hosts and major
                                  Internet archives. Look for filename
                                  "c-archie-1.3.2.tar.Z".

  Location of more information:   Packaged with software.

  Latest version number:          1.3.2


Foster                                                         [Page 21]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


  Brief Scope and Characteristics:

  This program provides a simple command line interface to the archie
  server system, using the Prospero protocol.  Written in C, it has
  been ported to MSDOS and OS2.

  General comments:

  This program should not be confused with the archie system telnet
  interface, which is a program that runs on the archie server itself.

  Future plans:  Not known

                        -------------------

  Date completed or updated:    1 November, 1993
  By: Name:                     Peter Deutsch
      Email address:            peterd@bunyip.com

  Platform:                     command line shell, written in Perl.
                                Works with both UNIX and MSDOS/OS2
                                shells.

  Primary Contact:
  Name:                         Khun Yee Fung
  Email address:                clipper@csd.uwo.ca
  Telephone:                    not known

  Client software available from: most archie server hosts and major
                                  Internet archives. Look for filename
                                  "perl-archie-3.8.tar.Z".


  Location of more information:   Packaged with software.

  Latest version number:          3.8

  Brief Scope and Characteristics:

  This program provides a simple command line interface to the archie
  server system, using the Prospero protocol.  Written in Perl.

  General comments:

  This program should not be confused with the archie system telnet
  interface, which is a program that runs on the archie server itself.

  Future plans:  Not known


Foster                                                         [Page 22]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


                        -------------------

  Date completed or updated:      1 November, 1993
  By: Name:                       Peter Deutsch
  Email address:                  peterd@bunyip.com

  Platform:                       archie client program for VMS systems.

  Primary Contact:
  Name:                           Brendan Kehoe
  Email address:                  brendan@cygnus.com
  Telephone:                      not known

  Client software available from: most archie server hosts and major
                                  Internet archives. Look for filename
                                  "archie-vms.com".

  Location of more information:   Packaged with software.

  Latest version number:          not known.

  Brief Scope and Characteristics:

  This program provides a simple command line interface to the archie
  server system for users of VMS.

  General comments:

  This program should not be confused with the archie system telnet
  interface, which is a program that runs on the archie server itself.

  Future plans:  Not known

                        -------------------

  Date completed or updated:      1 November, 1993
  By: Name:                       Peter Deutsch
  Email address:                  peterd@bunyip.com

  Platform:                       Xwindows client (X11R4)

  Primary Contact:
  Name:                           George Ferguson
  Email address:                  ferguson@cs.rochester.edu
  Telephone:                      not known

  Client software available from: cs.rochester.edu, most archie server
                                  hosts and major Internet archives.


Foster                                                         [Page 23]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


                                  Look for file "xarchie-1.3.tar.Z".

  Location of more information:   Packaged with software.

  Latest version number:          xarchie-1.3

  Brief Scope and Characteristics:

  This program provides an Xwindows client that allows users to search
  the archie anonymous FTP database.  Also included is the capability of
  fetching files (using ftp).

  General comments:               none.

  Future plans:  Not known

                        -------------------

  Date completed or updated:      1 November, 1993
  By: Name:                       Peter Deutsch
  Email address:                  peterd@bunyip.com

  Platform:                       NeXTStep client.

  Primary Contact:
  Name:                          Scott Stark
  Email address:                 me@superc.che.udel.edu
  Telephone:                     not known

  Client software available from: most archie server hosts and major
                                  Internet archives. Look for file
                                  "NeXTArchie.tar.Z".

  Location of more information:   Packaged with software.

  Latest version number:

  Brief Scope and Characteristics:

  This program provides a NeXTStep client that allows users to search
  the archie anonymous FTP database.  Also included is the capability
  of fetching files (using ftp).

  General comments:               none.

  Future plans:  Not known

 ----------------------------------------------------------------------


Foster                                                         [Page 24]

RFC 1689   Networked Information Retrieval: Tools and Groups August 1994


 Demonstration sites:

  Site name:      any one of:

    archie.rutgers.edu   128.6.18.15     (Rutgers University)
    archie.unl.edu       129.93.1.14     (University of Nebraska in
                                          Lincoln)
    archie.sura.net      128.167.254.179 (SURAnet archie server)
    archie.ans.net       147.225.1.2     (ANS archie server)
    archie.au            139.130.4.6     (Australian server)
    archie.funet.fi      128.214.6.100   (European server in Finland)
    archie.doc.ic.ac.uk  146.169.11.3    (UK/England server)
    archie.cs.huji.ac.il 132.65.6.15     (Israel server)
    archie.wide.ad.jp    133.4.3.6       (Japanese server)

  Client software should be supported at all of these sites.
  Additional sites are available. Use the "sites" command in the archie
  telnet interface at any of the above sites for a more complete lists.

  Access detail