|
The purpose
of these guidelines is to show how it is now possible
for local historians to put 'free format' databases on-line
without suffering the large costs that this currently
entails when using commercial organisations. Examples
are provided of four systems under development. An outline
is given of the basic requirements to build such a system
and groups wishing to learn more are invited to contact
the author
for more detailed information.
1. Introduction Top Most web sites created by local historians consist of static pages (just like this one) where the user can navigate between pages but the content of each page remains fixed. This is fine in many cases where the site contains a relatively small amount of information. However once a lot of information is involved some means is needed to select and display the information that is relevant at the time. One solution can be found on most E-commerce sites. For example, on a travel agent's web site a potential holiday-maker might type in the holiday locations and dates they are interested in. The software on the web server (the travel agent's computer) then finds which holidays and dates on offer match the user's requirements and the web server sends just these details to the holiday-maker's screen. This is a simple example of just one of many uses of on-line databases. Until recently the necessary software and access to web servers to do this were expensive and consequently outside the reach of the local historian who wished to provide access to historical data such as census information or indexes to publications. However, free, legal and robust software is now available for this type of application and most importantly a few of the free Internet Service Providers (ISPs) now permit the necessary software and data to be loaded on their web servers. The author has four such applications under development and it is suggested that the methods employed may be of interest to local history groups who might like to try something similar. It should be noted that the data in these particular applications are treated as 'free format' - that is each item (or record) in the database is not split into separate fields as in the tables in a relational database such as MS Access or Lotus Approach. These guidelines
set out the basic principles of the method and no attempt
has been made to give full details as each case will have
slightly different requirements and potential users are
invited to contact the author
to discuss their particular application. 2. A User's View Top User Interface A key consideration of any computer system is to make it as 'user friendly' as possible. When a user opens one of the authors on-line database applications described below they see a simple web page which is similar to the Google search engine as it contains little more than just a Search Text Input Box. Just as in Google the user types into the input box relevant key words or phrases for the items they wish to search for. The user then presses the Enter key (or clicks on the Submit Query button). After a few seconds all items that match the query are displayed in the user's browser (up to a preset limit of typically 200 items). This is different from Google which normally displays only ten items at a time. The total number of 'hits' or matches are displayed at the top of the output. The opportunity to try out three of these applications is provided below. Applications Under Development Items displayed from a search will have been found in the potentially huge data file the developer has installed on the web server belonging to their Internet Service Provider. Each item returned is technically a 'paragraph' in the data file. Here are some examples taken from the data file for projects which are currently under development. Note they are simply examples of individual data items and not the result of any particular search. A.
General Index to the Transactions of the
Bristol and Gloucestershire Archaeological Society (1876-1991)
B.
List of Contents of the Transactions of
the Bristol and Gloucestershire Archaeological Society
(1876-2002)
C.
Coaley Censuses 1841 - 1901
D.
A Historical Gazetteer of Cheltenham
There are about 134,000 items in the TBGAS General Index, over 2000 articles listed in the TBGAS contents list and about 5400 items in the Coaley Censuses. The Historical Gazetteer of Cheltenham in its printed form runs to about 200 pages. More Detailed Instructions for Carrying Out Searches Click here to visit the Search page for the General Index to the Transactions of the Bristol and Gloucestershire Archaeological Society (1876-1991) to try out the system for yourself. On this page you will see basic instructions on how to carry out a search. The instructions are also shown below.
Local Use of a Database System Although envisaged
to normally operate on-line the software and data files
can of course be used locally on a computer that is set
up to act as a web server. In this case all activities
take place purely on the local computer without the need
for any connection to the Internet by telephone or any
other means. The disadvantage of this is that all updates
to the data and software must be carried out at each location
that is running the application. However, this is a small
price to pay for the database always appearing to be on-line
without the cost or inconvenience of the telephone tied
up continuously. The relevant software is free and can
be obtained for Windows from the IndigoPerl
web site. 3. Outline Technical Description Top The system relies on four main things over and above the usual things such as access to the Internet and tools for editing files. The four key requirements are:
After the
data file the key to the whole process is the CGI script
which is effectively a computer program. The script accepts
the search key words and phrases from the box on the web
page in item 2 above. It then searches through the data
file for items that match the search criteria. Finally
it sends a count of the items found and the items themselves
(a series of paragraphs) to be displayed on the user's
web browser such as Internet Explorer. A general purpose
script has been written by the author such that normally
only minor changes are required to adapt it for any new
application. 4. Building Your Own System Top The last section gave the four key requirements and this section considers how a group might set about meeting these.
More About the the Data file and Searching it. The potential long length of the items in the data file should not normally be a problem as it is envisaged that other computer programs such as relational databases like MS Access and Lotus Approach may be used to generate the data file. Such an application may be described as a free format database even though the 'free format' data may actually be derived from several fields from an Access database as in fact is the case for examples B, C and D in Section 2 above. In a museum catalogue the inclusion of an extra keyword in a search like book, photograph or painting will (more or less) guarantee that the items returned will be that type or object. Similarly, for example the keyword 1997. (note the dot) will return all the items added to the collection in 1997 where the accessions for that year have identifiers 1997.1, 1997.2 etc. The method under discussion is not generally suitable for the 'rule based' queries which are possible with relational databases such as Access where, for example, you could count (or at least estimate!) the number of inhabitants over 50 from census data. It is possible
to manipulate and display on-line data that is stored
in tables in relational databases like Access. The MySQL
package is an extremely powerful and robust piece of free
software which does this. It is based on the industry
standard SQL language (pronounced Sequel) that has been
used on mainframes for many years. Unfortunately, there
appears to be only a few ISPs, at present, who offer MySQL
facilities free of change although a number provide it
as a 'paid for' service similar to the provision of CGI
facilities discussed above. 5. Concluding Remarks Top The improved availability of facilities to run CGI scripts has made it possible for local historians to build powerful on-line free format databases. The databases are fully searchable and can be applied to a variety of data such as the Censuses, gazetteers, indexes to publications and simple museum catalogues. Once a group has built a system for one set of data then it is normally a simple matter to build systems for other data that the group might wish to treat in the same way. Where it is more convenient it is also possible to run the same database on a locally based computer rather than on the Internet. Feedback on these notes will be welcomed. Please send them to the Author at ray.wilson@coaley.net .
|