Figure 3.1 shows the generic architecture to be used for this prototype system.
The system can be thought of as a two layer system with one layer containing existing code and black box applications, which can be easily interfaced with the second layer. This paper shall be concerned with the second layer, since it is the prototype code needed in order to provide translation between speech input and animation output.
As the Figure 3.1 shows, the application shall consist of three main parts; the interface with ViaVoice, the natural language processing code and the animation module that carries out the actions specified by the user. These modules are discussed in greater detail in Section 3.3.
The assumption that the existing technologies interface with the hardware, such as the graphics card for output and the microphone for input, allows only the interactions between the modules shown in Figure 3.1 to be considered.