22C:116, Lecture Notes, Aug. 23, 1995

Douglas W. Jones
University of Iowa Department of Computer Science

REVIEW:
This lecture begins an intensive review of operating systems. Everyone in this class should have had a fair amount of this material, but I do not expect everyone to have prior exposure to the same subset of this material, and I hope to cover the material quickly enough that everyone is exposed to at least some new material in every lecture.
The hardware on which we run an operating system is itself a system; as such, it is composed of several subsystems; here is a crude diagram of part of such a system:
```
 |-------|
 |  CPU  | CPU   CPU     -- CPUs
 |   |   |   \   /
 |  MEM  |    MEM        -- memory units
 |   |   |     |
 |  IOP  |    IOP        -- I/O processors
 |   |\  |     |\
 |   | \_______|_\____   -- a network
 |   |         |
 | other |   other       -- other I/O devices
 |  I/O  |    I/O
 |-------|
Traditional
```
The focus of a traditional operating system course is on a system with one CPU and no network. In this course, as suggested by the above figure, we will concentrate on systems that may include multiple CPUs, and indeed, systems that may include many computers interconnected by a network.
The CPU or CPUs are complex subsystems in their own right. Although CPU structures vary wildly, we will generally simplify the model, treating the CPU as a control unit, arithmetic unit, and a set of registers, including a program counter:
```
Control      Registers
Unit         |  PC   | The program counter
	     |       |        and
ALU <------> |_______| Some operand registers
```
Within the CPU, we will be concerned with the set of registers visible to the programmer, and not such details as the presence or absence of pipeline stage registers, microprogram support registers, or other registers hidden inside the CPU.
The most important operation on the registers that will concerns us is that of context switching. This involves first saving all registers of one context and then restoring all registers to build a new context.
Context switching is accomplished in different ways on different processors. Some CPUs support single ``context switch'' or ``exchange jump'' instructions that save and restore the entire CPU state in a single instruction. Such instructions generally take, as operands, the address of a data structure in memory to be used for saving or restoring the CPU state.
Another approach to context switching is to first push all of the registers on the CPU stack, pushing the program counter first (usually as part of the execution of a procedure call or interrupt). Once all the registers are on the stack, changing the stack pointer to the stack of the new context, followed by popping the registers and then returning from interrupt or returning from the procedure will complete the context switch.
Within a typical operating system, there will be a data structure used to store the context of a process. At minimum, this contains a program counter and at least one other register. The details of this data structure depend on both the hardware resources of the machine and on how the context switching is done.
Context switching software is always quirky, with strong machine dependencies. It can sometimes be written in a type-unsafe higher level language such as C or some versions of Pascal, but when this is so, the code must make strong assumptions about how the compiler and underlying hardware operate. The context switching code on most operating systems is still written in assembly language. If the operating system is to be portable, this code is typically in the form of a procedure that takes, as parameters, pointers to two context description records, one in which the old context is to be stored and one from which the new context is to be loaded.
Exercise: Design a context data structure and a context switching routine for any machine for which you can get adequate documentation.
Memory is sometimes a very simple resource, but sometimes, this apparent simplicity is misleading. Compiler writers, for example, must be very sensitive to addressability. On some systems, memory is byte addressed. On others, it is word addressed, and on most, there is some combination. For example, on most CISC architectures of the 1970's (the IBM 370, DEC VAX, Motorola 680x0, Intel 80x86, byte addressing was allowed but with a significant performance penalty. On some modern RISC systems such as the DEC Alpha, hardware support for byte addressing has been sharply reduced.
Operating systems must frequently contend with a different source of complexity in memory management, virtual addressing.
```
Virtual Memory     Lens       Real Memory
 |         |       ----       |         |
 |         |       \  /       |         |
 |         |        )(        |         |
 |         |       /  \       |         |
 |_________|       ----       |_________|
    CPU's         Memory
  Viewpoint     Management
	           Unit
```
A Memory Management Unit (or MMU) is present in most larger computer systems, sometimes as a component of the CPU, and sometimes as a separate component. The Motorola 680x0 and Intel 80x86 (for x > 2) have quite complex memory management units.
This unit serves as the "lens" through which programs operating on the CPU observe the memory. As with any lens, we can talk about the real world and the image of the world as seen by an observer looking through the the lens, the virtual image.
The memory management unit contains registers that may or may not be viewed as part of the context of a running program:
If the MMU registers are saved and restored as part of the context switching operation, the operation takes longer and the operation changes the memory address space. If the switch does not save and restore MMU registers, it is fast, but both the old and new contexts must share the same address space.
This is why we see systems with two levels of software parallelism -- lightweight or easy to switch threads that share address spaces with each other, and heavyweight and difficult to switch processes each of which has its own address space. This allows multiple parallel threads of control acting on behalf of a single application to coexist at very low cost, while allowing separate address spaces to be allocated to each application.
Register input-output is the simplest form of input-output. In order to avoid the complexity of references to a particular machine language, we will describe this using high level language procedure call syntax to describe the instructions:
```
 * send( data, device )         -- output
 * data = get( device )         -- input
 * repeat until ready( device ) -- wait for transfer complete
```
In register I/O, data is tranferred is by the program, one byte or word at a time. I/O registers may be addressed as if they are memory locations, or there may be a distinct I/O address space. The device ready indication is typically communicated by one bit in a status register; other bits may be used to indicate error conditions or other aspects of the device state.
Code for directly manipulating I/O devices is always quirky and highly machine and device dependant. It is frequently written in assembly language, although on machines that use memory-mapped I/O, use of assembly language is not needed. Even when device access code is written in a high level lanugage, however, it is rarely portable to different architectures!
Exercise: Write a routine to read a character from the keyboard input port or from an asynchronous input port of some machine -- do this with no use of operating system routines.
Direct memory access input-output is more complex. Typically, but not always, there is a distinction between the I/O controller and the device, so, for example, you may find multiple disks on one disk controller, or multiple SCSI devices (disks, tapes, and other widgits) on SCSI controller, as illustrated:
```
   _____                ______
  |     | __MAIN BUS__ |      |
  | CPU |<_____   ____>|Memory|
  |_____|      | |     |______|
               | |
              __v__
             |     | _____SCSI BUS______
             |SCSI |<____   ______   ___>
             |_____|     | |      | |
                         | |      | |
                        __v__    __v__
                       |     |  |     |
                       | DISK|  | TAPE|
                       |_____|  |_____|
```
In such a system, the SCSI controller can transfer data between the disk or the tape and main memory, under the direction of the CPU. To do this, the CPU must inform the controller of the memory address to be used and the operation to be performed, and through the controller, it must inform the disk or the tape drive of the device address (for example, the sector, track and cylinder) to be referenced and the operation to be performed.
Thus, a single DMA I/O transfer involves the following steps:
1. select device
2. select address of data on device
3. select buffer in main memory
4. start transfer in or out
5. await transfer complete
When direct memory access I/O hardware is employed, data is transferred one block at a time, generally in parallel with program execution. I/O is initiated by a program running on the CPU, and then it takes place in parallel with program execution. When done, a signal is sent to the CPU indicating that the transfer is complete.
A USER PERSPECTIVE
Users typically rely on operating systems to provide the following:
- File Systems
- Window Managers
- Command Languages
- Resource Sharing
- Resource Protection
The word ``file'' is frequently used rather carelessly to refer to any of the following items, even though they are quite different things:
- The object of I/O operations.
  For example, consider read(f,buffer,len), where f is an open file.
- A named object in a directory structure.
  For example, look at ~dwjones/ftpdir/pdp8/README
- A region on a disk or similar device.
  For example, the space allocated to ~dwjones/ftpdir/pdp8/README is described by I-node 14897 in our departmental file server.
These three meanings of the word "file" are usually used without any clarifying comment, and you have to figure out, from context which meaning is intended.
Generally, operating systems support an operation, such as open(f,name), that takes a file name and interprets it in a directory structure to deliver an open file object. This object may refer to a file on disk, or it may refer to something else such as a terminal, a communications line, or a window.
It should be noted that on some systems, a file on disk need not have any name in any directory structure!
Command Languages come in a great variety, from limited function Window-Icon-Mouse-Pointer interfaces to general purpose programming languages.
Before UNIX, command language interpreters were generally integral parts of the operating system, and many people mistakenly asserted that, quite naturally, each command language operation was identical to a call to an operating system routine.
In UNIX and most later systems, the command language interpreter, called a shell on UNIX, is merely an application program. If you don't like it, you can write your own. Most commands in the command language are merely names of programs, and the user of the command language interprets the commands as if they were parameterized procedure calls, where the programs themselves are the procedures being called.
Most users view the command language interpreter as part of the system, but for most purposes, the system views the command language interpreter as if it was a user program.
As an aside, note that in later versions of the UNIX, the system has a special relationship to the Bourne shell (the original UNIX shell) through the system() call, but this is not formally a system call; rather, it is a library routine that passes its argument to the Bourne shell.
exercise: Write a C program under UNIX that executes the standard ls program (the program that runs when you type the ls command at a UNIX shell prompt.