Chapter 3
Requirements and Methods of Investigation
Introduction
This chapter explains the research that was undertaken for the project, and the results. It describes the methods used for the research and why they were used. It gives the conclusions drawn from the research, resulting in a full list of requirements for the system. The chapter is divided into three sections:
3.2 Research
3.3 Requirements
3.1. Method of Investigation
The main source of information used while researching this project was the Internet. This is primarily because the project is to develop a web application, and much of the information about the world-wide-web is available on the web itself. Most blind charities also have web pages that were useful in obtaining information about the rights of the visually impaired, and the services that exist to support them.
During the process of researching, someone who works for a talking newspaper for the blind distributed on audio cassettes in the Luton area was consulted about the project. He agreed to put a message on the next edition of the newspaper asking for any visually impaired computer users who were interested in helping with the project to get into contact. One visually impaired person did so, and discussed his ideas about the project. He suggested information he would like to see included in the newspaper, as well as some ideas on how the system could work. He also gave valuable feedback on various ideas about the system’s operation. As many of his suggestions as possible have been used in the course of developing the system.
While developing the project itself books were used to aid in the design [CARROL 89] and implementation [PETZOLD 89] of the system. Notes from university lectures and sections of code developed as part of exercises set in lectures were also a valuable resource.
3.2. Research
During the course of researching the project a great deal of information about systems in place to help visually impaired people get access to news and information was found. Information about how computers are made accessible to visually impaired users through the use of screen readers, etc. was also investigated.
While investigating popular news web pages, the way such pages are designed was investigated. It was discovered that almost all pages on the Internet use tables to create complex page layouts, and that screen readers and talking browsers have trouble getting the information across in an intelligible manner. Other HTML tags that are commonly used in web pages, and how they are used to structure pages was also researched.
The official HTML 4.0 specification [HTML] has guidelines for “non-visual user agents” such as the one this project aims to produce. Unfortunately, many of them rely on the web pages being used taking certain guidelines into account, and most (if not all) of the pages examined during research failed to follow these guidelines.
3.3. Requirements
This section will outline the main requirements for the system to be developed, divided into sections representing different aspects of its operation. The requirements were obtained by investigation of existing systems, discussions with a potential user, and the author’s ideas about how the system should operate.
It was decided that the best way to present the information in a page to the user in a simple and understandable way was to remove all the parts of the page that are not necessary to the understanding of the content. This includes adverts, navigation menus, etc. This is much simpler than trying to confer meanings and importance from the different sections in a page (in most cases this is impossible).
A number of aspects of how the system could work became clear as research progressed. The system would have to be maintained from a central location because producing a completely automated system for finding information on the Internet would be extremely difficult. To do this, a database of pages that can be used in producing the newspaper must be used. It must not only contain the addresses of the pages, but also details of how they can be converted into text in a way that is understandable to a screen reader. Attempting to devise algorithms to extract relevant information from a web page automatically is beyond the scope of this project (if it is possible at all).
In order to allow the user to browse the newspaper as a sighted user would, it has to be divided into sections so that the user can choose to read certain sections and skip others. To achieve this the newspaper has to be structured as a hierarchical document.
In order to get the important information from a web site, more than just the main page is needed. Generally, the main page is a contents page containing links to each item or news story. It was decided that by getting the main page, and following certain links on the main page, enough information could be retrieved from the site. Some experiments were done with following links to a certain depth, i.e. following the links on the main page, and then the links on each of the linked pages etc. to a certain depth. This proved to be very complicated, and it was decided that the extra amount of information gained was negligible.
The name “Web News Speak” was chosen for the application, and it will be referred to as such in the remainder of this document.
A formal list of requirements is shown in figure 3.3a.
|
Page Database
|
|
|
User Interface
|
|
|
Downloading Pages
|
|
|
System Output (i.e. the newspaper
itself)
|
|
Figure 3.3a -
Requirements
Where the word “should” is used, a requirement is desired but not absolutely necessary. Where the word “must” is used the requirement is necessary for the system’s operation.