Sysgroup Handbook

Printed 18 July 1994

This is Edition 1.0 of PSU-CS Sysgroup Handbook, for the Computer Science Systems Staff. last updated 29 September 1993 . Printed 18 July 1994.

Published by Portland State University P.O. Box 751, CMPS Portland, OR 97207

Permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and this permission notice are preserved on all copies.

Permission is granted to copy and distribute modified versions of this manual under the conditions for verbatim copying, provided that the entire resulting derived work is distributed under the terms of a permission notice identical to this one.

Introduction

The Computer Science Systems Group (Sysgroup) is dedicated to providing and maintaining computing resources for the CS faculty, staff and students. Sysgroup is responsible for:

Hardware (computers, terminals, printers, etc).
Networks (hardware and SW interdependencies).
Software (OS and local).
User services (in cooperation with the Tutoring staff).
Problem response and emergency handling.

Sysgroup's goal is to provide and maintain a wide range of hardware and software with the greatest quality possible. Unlike a Computer Center, it is our responsibility to be flexible to unusual requests, and fulfill them to the best of our ability.

This document explains the procedures which have been unwritten for many years, and others which are newly created for the changing needs of the department. It is not intended to be an all-inclusive set of rules and regulations, but instead, a set of guidelines (and some rules) to help shape the culture within the Systems Staff.

The fewer rules the better: Rules reduce freedom and responsibility. Enforcement of rules is coercive and manipulative, which diminishes spontaneity and absorbs group energy.
The more coercive you are, the more resistant the group will become. Your manipulations will only breed evasions. Every law creates an outlaw. This is no way to run a group. (1)

Volunteers

Much of the work on the Systems is performed by students. These students generally start out as volunteers who work for their own edification and for the opportunity to become a paid member of the Sysgroup.

In some cases, a volunteer may be an underclassman or may not be attending PSU and, as such, not be entitled to an account on the CS machines. However, for this purpose, a volunteer may be granted a guest account. See section Guest Accounts, for more info.

In order to become a volunteer, a person must attend the Systems Workshop, pass a pre-test, and be interviewed by the Systems Manager. A file will be kept on each volunteer, which should, at minimum, contain a copy of their resume.

After a volunteer is accepted as a new member of Sysgroup, they will be put into the "sysgroup" mailing list.

Systems Workshop

In order to maintain a continuous supply of persons for doing systems work, a training program will be run. It will have approximately 1-2 hours of lecture per week, and will cover much of the necessary background material for Systems Administration (i.e. shell programming, filesystem structure, networking, troff, etc...) along with actual Systems Administration material.

Some early lectures have been video-taped and are available in the EE office. The materials for this workshop may be found in `/home/rigel/sys/group/workshop'.

SIGs

Since the Systems Workshop is intended to train volunteers at a lowest common denominator, there are some people who may want to pursue specialized topics. To encourage this, special interest groups will be formed to discuss, and work on, their chosen subject area in greater detail that would be possible in the Systems Workshops.

Each SIG will have a coordinator, who will be charged with arranging meeting times, and leads the group. People interested in doing this should post a message to `psu.systems.group'.

These SIG's may also produce lectures for the general Sysgroup population, and put them on videotape.

The most crucial SIG is CORE (Computer Operations Research E?) which concentrates on hardcore C programming, i.e. kernel hacking, device drivers, and similar sorts of systems programming.

This SIG is not intended to teach the basics of C programming (although, the participants may work on such a workshop). Some of the work will be done as part of job duties with the Systems Staff, other projects may be strictly voluntary.

Job Descriptions

This chapter contains the job descriptions for each member of The Systems Staff. Some general information follows, that applies to every member of The Systems Staff.

The Systems Staff consists of the following:

The Systems Manager (a full-time research assistant).
The Systems Programmer (also a full-time research assistant).
Part-time student employees (typically 4-8 people). Some of these may be assigned to other departments.
Graduate students working on Systems projects. One Teaching Assistant is currently assigned to the Systems Staff.
Volunteer systems people (highly variable, usually 5-20 people)

Except for the Systems Manager and the Systems Programmer, all job descriptions list all possible duties applicable under that title. When a person is hired, certain duties will be assigned out of these lists. This list may be renegotiated by both parties as necessary. It is quite possible for a person to have more than one job title which applies to them if they have assigned duties from several job titles.

The reason for this approach is that student employees have a wide range of skills and available working hours, and also because of natural turnover. The approach used here avoids having to rewrite job descriptions continually.

General Info

This section details the responsibilities and other vital information applicable to all members of the Systems Staff.

All members of the Systems Staff are responsible for following the policies within this document. In particular, responding to e-mail and emergencies are of paramount importance. See section Problem Response, for more info.

All members of the Systems Staff are expected to attend the weekly Sysgroup meetings and to provide a summary of work done over the week.

Also, all Systems Staff members will use the task system for tracking work done and in progress (See section Task System).

Any member of the Systems Staff may be exposed to the following working conditions. Reasonable accommodations can be arranged to avoid certain working conditions, if necessary, and the specific job description does not specify that this is a necessary condition.

Machine room conditions: high noise level, temperatures below room temperature.
Stringing cable; involves crawling under/around/behind furniture.
Lifting and carrying various pieces of computer equipment and, possibly, furniture.
Driving: for moving various computer equipment to/from other locations.

Any member of the Systems Staff may have any number of volunteers entrusted to their supervision.

Systems Manager

The Systems Manager is a full-time research assistant who manages all hardware and software in the department (see section Introduction, for more details.), and the Systems Staff. The Systems Manager is responsible for ensuring that the policies within this document are followed by the Systems Staff, including the Systems Manager.

The Systems Manager works under the Faculty Systems Committee and will meet with them periodically (every 3-4 weeks).

The Systems Manager's duties are:

Maintain all computer equipment: computers (PCs, workstations, etc.), printers, networks, disk drives, tape drives, etc.
Recommend hardware purchases, repairs, and service contracts (as necessary).
Evaluate and recommend purchases of software and service, install, announce and maintain software and documentation.
Manage the entire Systems Staff (as specified in section General Info), including: hiring and firing, assigning jobs, monitoring progress, and evaluating work. The Systems Manager will work with the Tutor Supervisor regarding any personnel working as tutors.
Coordinate (with the Systems Staff and user community) and schedule responses to systems problems and emergencies. The policies which the Systems Manager and Staff must follow is specified elsewhere in this document; see section Problem Response and section E-Mail Response in particular.
Hold weekly meetings at which students, faculty, and others may voice their needs and concerns. The list of current systems projects shall be presented and discussed.
Meet, individually, with faculty members periodically to discuss their needs and concerns. These meetings should occur a few weeks before the start of each term.

The Systems Manager will work with the Electrical Engineering Systems Manager and Hardware Technician to maintain shared equipment. The Systems Manager will also work cooperatively with the Systems Staffs of other departments in the University and, contact shall be maintained with Systems Staffs of other Institutions in the area (i.e. OSU, OGI).

Systems Programmer

The Systems Programmer is a full-time research assistant who develops, installs and maintains a wide range of system and application software. The Systems Programmer also administrates and co-administrates many systems.

The Systems Programmer must be an experienced Systems Administrator, and must be a proficient programmer, particularly in the languages suited to systems work (C, Perl, Bourne Shell, awk, sed, etc...), not to mention being skilled in installing and maintaining software.

The Systems Programmer works directly for the Systems Manager.

The specific duties are as follows:

Evaluate and obtain appropriate software, either via purchasing or the Internet.
Compile, install, test, and debug locally installed software.
Maintain a database of the locally installed software. This database should include: the purpose of the software, on what machine(s) it is installed, what versions are installed, where this software was obtained and where future revisions may be obtained.
Make sure that the documentation of the aforementioned software is up-to-date and available to the user community.
Administrate the Appletalk network and all machines connected to it. This includes installing and upgrading software, diagnosing network problems, and assisting the users of those machines.
Manage the department's UUCP connections: this includes setting up new connections and ensuring that current connections are functioning.
Manage the department's modem pool and terminal server: Installing new modems, diagnosing dialup problems and fixing them (if possible). The terminal server attached to the modems must also be maintained, upgraded, and have problems diagnosed. Gather statistics on modem usage, and survey users about modem service (as necessary). (See section Modems, for more info.)
The Systems Programmer will serve as Systems Manager when the latter is not present (on vacation, etc.)

The Systems Programmer will commonly have contact with the EE Hardware Technician, the EE Systems Manager, Telecommunications service, and System Administrators of other organizations (for UUCP).

Hardware Technician

The Hardware Technician maintains and repairs the various hardware which the CS Department has. This position is part-time (approx 10-20 hours per week).

The Hardware Technician should be able to diagnose common problems and replace components (i.e. flyback transformers).

The Hardware Technician will have to be able to work in all the working conditions specified above (See section General Info, for more info.)

The Hardware Technician has the following duties:

Install, troubleshoot, replace or repair computer components, including monitors, keyboards, mice, disk-drives, CPU's, etc. This includes not only replacement of entire units but also replacement of components within those units.
If a component cannot be repaired by the Hardware Technician, then she/he will recommend to the Systems Manager that the unit either be sent out for repairs, cannibalized, or disposed of.
Keep track of the inventory of spare parts kept by the Systems Staff.
Keep the university inventory lists up to date and conduct the annual inventory verification.
Do periodic maintenance on the department's hardware, as specified in section Routine Preventive Maintenance.

Some of the common contacts outside of the Systems Staff for this position are: The EE Hardware Technician, Physical Plant Electrician(s), Property Control Specialist, CS and EE Office Coordinator and Secretaries.

X Administrator

The X Administrator is a part-time (5-10 hours/week) position responsible for the operation and administration of the X terminals and the server machine(s) for them.

The X Administrator should be a proficient user of X windows, and be familiar with the overall structure of the system. Also, this person should have some system administration skills.

The X Administrator works directly under the Systems Manager, although it is possible that there may be more than one X Administrator, in which case they will also coordinate work with each other.

The X Administrator performs the following duties:

Installing new X terminals, configuring their hardware, connecting them to the network, booting them, ensure that they can access the appropriate configuration and font information, and see that xdm is running on them.
Periodically verify that all the X terminals are functioning as specified in the last item.

This person will work closely with the X programmer(s), see section X Programmer, for more information.

X Programmer

The X Programmer is a part-time position who maintains the X windows software on all the machines that run it.

The X Programmer should be an experienced C programmer, an experienced X windows user and, preferably, have some knowledge of X programming.

The X Programmer answers directly to the Systems Manager.

The duties for the position are:

Maintain the organization of the X source tree.
Maintain the consistency of the installed X software on the CS systems.
Obtain X windows upgrades and install them.
Compile, install, troubleshoot and, if possible, fix X windows software.
Investigate reported problems with X software.
Submit proper bug reports to MIT when necessary.

The X Programmer will work closely with the X Administrator, see section X Administrator, for more info.

Assistant Systems Administrator

The Assistant Systems Administrators are part-time student employees who administrate small systems or co-administrate the larger CS machines. They answer directly to the Systems Manager. As the amount of time varies due to the different duties which fall under this category, the amount of time each duty takes is noted in brackets after the duty.

People in these positions must be experienced UNIX users and either some experience or training in System Administration (see section Systems Workshop, for more info) Shell programming skills are also important to these positions.

Diagnose and correct (or refer to someone else) systems problems, see section Problem Response, for more info. []
Manage the Account System: this includes filing user account request forms, creating user accounts, diagnosing and correcting user account problems and maintaining the account software. The first two duties may be performed by Office Staff under this person's supervision. []
Monitor disk space: generate summaries of disk usage, notify users utilizing excessive space, assist users in managing their disk space. The first two items may be done automatically, if care is taken to ensure accuracy. []
Monitor the security of the CS Systems: monitoring the systems for security violations or intrusions, using or activating software to track suspected intrusions, and maintaining software for monitoring security. See section Security, for more information. The person assigned this duty is expected to uphold the highest moral and ethical standards, since the execution of these duties may expose the individual to sensitive or confidential material. []
Maintain backups of departmental systems. This involves organizing the backup media, ensuring that the backup software functions properly and on-schedule, and fulfilling restoration requests. This job requires periodic night work (at least once per term). [About 10 hours/week].
Install and configure computer systems (including hardware and software).
Administrate a system, in its daily operations. This may involve coordinating work with other Assistant Systems Administrators who have the related or overlapping duties.

Assistant Systems Programmer

The Assistant Systems Programmer is a part-time employee who works on specific programming projects. This may involve developing new software, or maintaining existing software. This person answers directly to the Systems Manager.

The Assistant Systems Programmer will be a proficient programmer, particularly in languages relevant to Systems Work (e.g. C, C++, Perl, Bourne Shell, awk, sed, etc.)

Evaluate and recommend software for departmental use.
Compile, test, debug, and install software packages. If necessary, submit corrections them to the author(s).
Adapt existing software to new situations (i.e. add new options, features, etc.)
Develop new software as requested by the Systems Manager, using a language and coding style approved by the Systems Manager.

Keys and Building Access

Any member of the Systems Staff who has special access to any of the department's resources will be expected to follow certain guidelines.

After-Hours Access

After-hours access is only available on a special-case basis. There is no after-hours access to the Mill Street Lab, only to the Systems Office.

Anybody with after-hour access is expected to act responsibly with this privilege, those who cannot will be relieved of the burden.

Keys

The only people who will be given keys are those on University payroll (Student employees, Teaching Assistants, etc...) The rooms for which keys will be given are (in order of preference): Terminal Room, Systems Office(s), Machine Room, and Computer Science Office. Other keys (such as master keys) will be given out if they are necessary for the performance of the job.

Office Policies

Anybody who has access to the Systems Office is assumed to be a trustworthy and responsible individual. Anybody who has a desk in the Systems Office is expected to follow the following guidelines:

Other peoples' desks should be treated as you would want your desk to be treated. Everybody in the office should be aware of how others want their desk treated, and respect their wishes. In general, its good to assume that people want their desks to be private.
The door may be unlocked or open while someone is in the Systems Office, but it must be closed and locked if no one is there.
The deadbolt should be thrown whenever the office will be unattended for a long time (i.e. at the end of the day).
A computer on your desk is not yours exclusively. Do not lock your screen. If you leave yourself logged in, your login may be used by other members of the Systems Staff for various reasons (checking print queues, running top, etc.).
If you use someone else's desk, make sure to clean up after yourself. In general, you should leave it as you found it.

Checking out Materials

Most of the manuals in the Systems Office may be borrowed, provided that they are checked out by someone known to the Systems Staff, and that the materials are listed on the check-out sheet.

Schedules

Anybody working on important projects or any paid Systems person should keep a schedule in their `.plan' file, and posted on the door to the Systems Office. This schedule should specify when they are in PCAT, in person. Also, a schedule should be given to the Systems Manager.

During a person's specified office hours (if they have any) the in-out board outside of the systems office should be used.

Problem Response

The top priority of the Systems Staff is the timely response to problems. Anything else we do is utterly meaningless (to the user populace) unless we ensure that all problems get responded to, and that our demeanour always suggests that we are genuinely interested in solving their problems.

The following sections detail how and when problems should be responded to.

E-Mail Response

As a member of the Systems Staff, it is vitally important that E-mail is used effectively, not only from a mechanical perspective (i.e. how do I read mail?) but also from a stylistic perspective, which is what the remainder of this document covers.

Due to the varied nature of sysgroup mailings, generalities cannot be avoided in describing them. Hopefully, after being in sysgroup for a while, everything described herein will become clear.

Local Procedures

To maintain optimum relations with faculty, students and any others, the following conventions must be followed. If you are unable to follow these conventions, you will be put into a position where you will not need to.

When a message comes to sysgroup, there should be an initial response (ACK) which will either indicate that you will be working on the problem, or that you have found a solution. This initial response should also contain a time estimate as to when the work will be compleated. This initial message must be sent within the following time frames. (See section Systems Manager, for more info.)

Faculty: 1 day
Students: 3 days
Others: 1 week

When you send an initial response you are implicitly taking the responsibility for seeing that task through to it's completion. This does not mean you have to resolve the problem yourself, but if you are unable to, you are responsible for finding someone who is able to resolve it, and that the problem is, ultimately, resolved.

Unless you don't know the answer to a person's problem, or you know that someone else will respond, you should never ignore a sysgroup message. It is our collective responsibility to maintain an optimum response time.

If the project takes a long time, status reports should be sent periodically. These status reports should be sent to the involved individuals and to sysgroup.

When the problem is corrected, a message should be sent informing the involved individuals that you think it is fixed, and asking them to make sure it is to their satisfaction.

Graphically, the process is like this:

     Mail -.----------------,-----------------,-> Final
            \              /                 /
             \            /                 /
              `----> ACK --------> Status -<
                               (            )
                                `----------'

General Etiquette

Most of these items in this section are taken directly from various USENET etiquette guidelines(2).

Keep .sig short, less than 4 lines.
Keep your lines under 80 characters, and under 72 if possible. (most editors have a fill or format mode that will do this for you automatically).
Right justified text may look "prettier" in some sense, but it is almost always harder to read than leaving ragged right margins; don't justify your articles.
Most special control characters will not work for most readers. In fact, the space character is about the only one you can be sure will work consistently. Even tabs aren't always the same from machine to machine, and should be avoided. Many mail agents will strip or remap control characters.
Messages in a single case (all upper or all lower) are difficult to read. However, all upper is far worse than all lower.
Subtlety is not communicated well in written form - especially over a computer. The same applies to humor.
References need to be made. When you answer mail, you have the original message fresh in your mind. When I receive your answer, I don't. Most mail readers have a facility for copying blocks of the message you are responding to; please use it whenever necessary.
Make an effort to spell words correctly. Obvious misspellings are jarring and distract the reader.
Keep paragraphs short and sweet. Keep sentences shorter and sweeter. This means "concise," not cryptic.
White space is not wasted space -- it greatly improves clarity. A blank line only adds a byte to the article length, so don't be stingy if it will help make your meaning clearer.

Sysgroup Conventions

These are some E-mail guidelines specific to anybody responding to messages on sysgroup:

Before replying to a sysgroup message, make sure that nobody else has yet responded, i.e. read all E-mail before replying to any messages, or read your E-mail backwards.
Make sure that any replies are sent to everyone who received the mail originally, so that everyone will know that someone responded.
Use the reply function of your mail reading program so that proper `Subject' and `In-reply-to' line are generated. This is for automated software which monitors Sysgroup mail (see section Task System for more information).
Do not just say "it's fixed" (unless the solution was obvious). Give an explanation of what went wrong and how you fixed it.
But, after providing a detailed description of the problem, make sure and include a bottom line, i.e. summarize what they should/shouldn't do, etc.
Don't include a `.sig' (nothing beyond a single e-mail address) to anyone on campus. Including it on msgs is OK. Either way the 4 line rule still applies (q.v.).
Never insult anyone, directly or indirectly. Some examples: "check your facts before posting.", "Jane, you ignorant slut!" or "RTFM" are inappropriate. The second one should just be omitted (unless you are Dan Akroyd), the other two can be replaced with references to specific manual pages.
If you include the message you are responding to, include only the relevant portions.
Do not include the sender's .sig in the included text, unless you are responding to part of it (i.e. pointing out an incorrect e-mail address).
Verify that your solution works before proclaiming the problem solved.
Do not guess. If you are not sure of a solution, either research it yourself or ask someone who knows.
If the best you can do is guess, make sure and state that you are guessing in your reply.

Task System

We are using a mail monitoring tool called GIPR (3) to handle task management.It intercepts any mail going to Sysgroup and stores and indexes each message. This way when someone sends a problem to Sysgroup, GIPR initiates a 'task'. Each time someone sends a related message it is stored with the original 'task'. When a task is finished a message is sent, replying to the person who sent the original problem, marked as a 'resolution', and then GIPR removes it from the list of tasks to be done.

The first person to respond to a message will be assigned responsibility for the task (although this can be reassigned later).

Since most operations can be done from a mail-reader and messages are coordinated by their `In-Reply-To:' fields, this means that you should always keep a message relevant to a task you are working on.

Keywords

Most operations on the task system should be doable from any mail-reader. Control information for GIPR is encoded into the subject line as keywords between square brackets.

`[IGNORE], [I], [NOTICE]': Cause GIPR to compleatly disregard the message. There are a number of other keywords for doing this, but they are to prevent NW-Net outage messages from becoming tasks.
`[RESOLV], [RESOLVE], [RESOLVED]': Resolve this task.
`[RESP]': Take responsibility for this task.
`[URGENT]': Notify the on-call person (via klaxon or pager) that there is a major problem.

Task Priority

The specifications for the priority field were assigned by the Faculty Systems Committee, and, thus, it is very important that they be followed.

The priority field consists of a letter `A-F' and a digit. The letter signifies the broad priority category. The number specifies a relative priority within the priority category.

`A': Emergency work: keeping machines up and usable.
`B': Faculty-Committee requested work.
`C': Work requested by faculty/staff.
`D': Work requested by students.
`E': Systems work. These tasks are generally behind-the-scenes jobs which have no obvious benefits for the user population, but can prevent future crises.
`F': Other work (for guests and off-campus people).
`G': Ongoing work: tasks that are never finished, or have to be done periodically.

These priorities are set by putting a line like `Priority: pri' anywhere in your message.

GIPR Command

Many operations can be done on the task list by using GIPR at the command line. Ideally, few of these should ever be used.

Here is a summary of the more important command line options (in approximate order of importance):

{User Option} -l [task numbers]

List active tasks.

{User Option} -s [task numbers]

Show a given message

{User Option} -p [task numbers]

Purge a message which should never have been made a task. This should not be done lightly.

{User Option} -r [task numbers]

Resolve the given task(s). This should not be used, in general.

{User Option} -a [base task number] [task numbers]

Attach the given task(s) to the given base task. This is mainly used to aggregate related tasks together, and to make up for brain-damaged mailers.

{User Option} -M [task number]

Return an `In-Reply-To:' line for a given task. This is used when the original message has been lost/erased/&c.

{User Option} -E [task number] [estimate date]

Change a task's estimate date. This should not be used, in general.

{User Option} -P [task number] [priority]

Change a task's priority. This should not be used, in general.

Emergency Handling

When there is an emergency to deal with, and the Systems Manager is not present to handle it, those present must determine who is in charge. The order of succession in such a situation is:

Systems Programmer or any other full-time Systems person (including EE).
Student Systems Administrators.
Student Systems Programmers.
Tutors
Systems Volunteers

In any case, a person with keys has preference over persons without.

If the person in charge cannot handle the situation (particularly if the person is in one of the latter two categories) their first responsibility is to find someone who can (via phone, pager, talk, etc.)

Pager Policy

When none of the previous methods for handling a problem work, it may be necessary to page someone. Currently we have only one pager, its number is 299-9490. Below are some notes for both the person with the pager and the one doing the paging.

Make sure this is a problem worthy of paging, and that other avenues of solving the problem are exhausted before continuing.
Make sure you are on a touch-tone phone, so that you can enter your phone number to be displayed on the pager.
Any page with does not supply a valid phone number will be ignored.
Make sure that the person can get in contact with you when they call, i.e. stay by the phone and keep the line open.
It may take a while for the on-call person with the pager to respond, depending on where they are. Be patient.
If multiple pages are received before the person can respond, they will assume that it is a dire emergency. Do not do this lightly.

Eventually, it would be nice to set up a priority/event code into the area code, so that the on-call person will have an idea of the importance and meaning of the page. If you have any ideas for this please let me know.

System Configuration

This is an outline of what a generic workstation in the CS department will be like. This configuration will ease administration of the increasing numbers of workstations, and make the computing environment more consistent between workstations.

Deviations from this are possible, but will have their costs (either in decreased service, or in local SW maintenance).

Generic Hardware

Some sort of Sun Sparc (IP[XC], ELC, 1+, 2, 10, etc...) With at enough disk space to hold the OS (~400 megs).

Generic Filesystems

The filesystems `/', `/usr', `/var' and the like will be local.

One or more local filesystems for a faculty member's use may exist with the naming convention of `/home/hostname'/food name All other filesystems will be taken care of via the automounter.

`/usr/local' (including `X11') will be mounted from rigel, sirius or xavier.

Generic Accounts

Accounts will be distributed via NIS (YP), and all home directories will be mounted from rigel.

Depending on the application, some or all users may be excluded from the workstation (although NIS will still run)

Generic Mail

Mail will not be handled locally, i.e. mail sent to a workstation will be forwarded to rigel.

Rigel's mail spool will be mounted such that mail can be read locally

Generic Networking

routed will run, to lessen dependence on gateways

Generic Checklist

Set the root passwd
Set up the local filesystems: Non-OS filesystems should be named with the convention `/home/hostname'/food name.
Set up NFS/automounter:
- Add the machine to the exports and netgroup files, as appropriate.
- Make the mountpoints. First generate a list of directories to make (using GNU version of these commands): `du -x /home | awk '{print $22' | tac > /tmp/y'. Then copy that file to the new machine and do the following `a=`/tmp/y`; mkdir $a'.
- Copy the automount files from `cs.pdx.edu', uncomment the appropriate entries.
- Modify the automount command in `/etc/rc.local' to be the same as other sparcs.
- Add this workstation's filesystems to the automount file on `cs.pdx.edu'
- Add the local filesystems to `/etc/exports'
Set up the printcap files: copy the appropriate one from `/src/Admin/netadmin/printcap'. Then run make dirs in that directory.
Copy `/etc/resolv.conf' from a similar workstation.
Copy `/etc/sendmail.cf' from a similar workstation.
Put the right people into group wheel
Set up the sudo directory: `/usr/adm/sudo'
Set up our local version of man.
- Move `/usr/ucb/man', `/usr/ucb/whatis' and `/usr/ucb/apropos' aside.
- Make links to `/usr/local/bin/man'
- Move `/usr/etc/catman' aside.
- Make link to `/usr/local/etc/catman'
- Rerun `catman' on the system man directory
copy the following files from `cs.pdx.edu': `/usr/lib/Mail.rc'
Install the encryption kit and any OS patches.

Root Folklore

This section details various folklore and guidelines for people who have root access on the CS machines.

Root Etiquette

In an environment where many people have root access, it is very easy to step on other people's toes. There are two paradigms for avoiding this:

Set up regulations, guidelines and possibly programs to prevent this from happening. This document is an example of the former, and the program vipw is an example of the latter.
Watch where you are stepping so as to avoid stepping on other's toes.

The approach taken in the Sysgroup is a combination of the two preceding paradigms. Some guidelines will be detailed hereafter, which should give an idea as to what some of the common problem areas are.

Beyond these guidelines, anybody with root access is assumed to be conscientious and sensitive enough to avoid most problem areas.

Do not make any configuration changes to a machine you are not directly responsible for. If you must, make sure that the person responsible for the machine is informed (this is in addition to the standard logging).
Don't change the root password without approval from primary rooter and the Systems Manager. Notify all rooters of change.
Use sudo whenever possible, and use it for single commands when possible.
Never log in as `root', unless absolutely necessary.
Respect other user's (esp. professors) privacy. Do not look at others files except as necessary for keeping the system running, and treat any sensitive information that is seen in this process appropriately. See section Account Security.

Root Resources

There is a variety of sources for information and assistance when encountered with system problems.

There is a great deal of information in `/home/rigel/sys/group', the notable files are:

`form-letters/rhosts': The standard form letter to use when removing someone's `.rhosts' file.
`form-letters/background': The standard form letter to use when killing runaway jobs.
`doc/inet-access': Information for people who want Internet access but are not eligible for a CS account.
`name-themes/*': Various ideas for naming groups of machines. Contributions are welcome.

Also there are a number of local tools for diagnosing and fixing systems problems. They may be located in any of the following directories, but the first two are definitely preferred:

`/usr/local/etc'
`/home/rigel/sys/group/bin'
`~/trent/bin'
`~/trent/src/perl'

`gipr': See section Task System, for more information.
`chkaddr': Recursively resolve mail addresses.
`chkps': Find processes run by people who are not logged in.
`find-rhosts': Find and/or remove `.rhosts' file. (Uses the aforementioned form letter).
`whenwho': Determine who was logged in at a given time.
`mkhome': Make a home directory for a user (with startup files).
`lsof': List open files (to find errant processes, &c.)

Common Problems

This section details various common problems and fixes for them. This is not (and can never be) all-inclusive. Some related information can be found in section Modems and section Security.

Account Problems

There are a number of things that can go wrong with a user's account.

One of the most common is a forgotten password. There are two ways to handle this:

Copy their password from a machine they remember the password for.
Meet them in-person and have them change their password after verifying their identity.

Another common problem is a person's startup files getting messed up. The files which can cause the greatest problems are `.cshrc', `.login', and `.xsession'.

Sometimes a user's home directory will have incorrect permissions (via `chmod' experimentation) or incorrect ownership (via SysAdmin oversight).

Another occasional problem is the password file on a YP server and its clients getting out of sync. There should be a program called `makeyp' which will push changes out, if not, run `make' in `/var/yp'

Printer Problems

- printer queues

A common problem (especially after a reboot) is the printer daemon silently dying. This can happen on any machine running `lpd', although it is most common on Suns. `lpq' will indicate this with `no daemon present'. If there is no `lpd' running on the machine, start another up.

Another strategy for dealing with misbehaving printer daemons is to kill all the `lpd''s and restart one `/usr/lib/lpd'.

Sometimes the print queue jams, i.e. the active job has printed but it never gets removed from the queue. In this case use `lprm' to remove the active job.

Process Problems

You can use chkps -e root to find jobs belonging to people no longer logged in. If they are not `nice''d you may kill them, and notify the people that you did so. There is a form letter for this.

Pty Problems

Sometimes a runnaway process will cause a pty to misbeahve. Use `chkps' as above to find and kill any offending programs. If that doesn't work try to determine the next free pty and use `lsof' to find the misbehaving processes.

Mail Problems

The program `chkaddr' can help find mail loops (via `.forwards').

System Change Policy

This chapter details the guidelines for altering hardware and software which is in general use in the department.

Hardware Change Policy

All significant hardware changes should be preceded by a full backup to prevent any data loss due to the change. See section Backups. The following sections also apply to hardware modifications, as it typically requires downtime. See section Planned Downtime and section Routine Preventive Maintenance, for more information.

Software Change Policy

Test the software in question before installation or install it on an isolated machine for testing. This is particularly important if the software will affect large numbers of people (i.e. changing a login shell).

Changes to heavily used pieces of software require that the entire user community be notified. For widely used software, post notification to msgs. Otherwise, send mail to the user(s) of the software.

After the software is installed it should always be tested again. If testing is difficult due to a lack of understanding of the software (e.g. specialized computer languages), the person who requested the software installation should be solicited for assistance

After the software is installed, another message should be posted specifying the changes made, and the extent of the testing.

Either the person who made the changes, or someone familiar with them, should be on hand during the next day to assist with any problems that may arise.

The old versions of the software should be kept in case problems with the new version arise.

Record Keeping

Any time any modification is made to a machine, or any significant event occurs (i.e. a system crash) a record should be kept of it.

After much consideration, it seems as though the most difficult part of getting people to keep records is making the system convenient. The following system is the simplest I have been able to work out.

A set of mailboxes, located inside the Systems Office, is dedicated to specific system information. Any information (logs, notifications, packing slips, etc.) about a machine should be put in the appropriate box. Make sure that the information is dated and specifies who wrote it.

Downtime Policy

The following sections specify the policies for the various types of system downtime: Planned, Unplanned, and Emergency.

Planned Downtime

Any system down time should be announced well in advance of the planned date. The amount of lead time should be proportional to the amount of (projected) downtime and to the importance of the machine. Any general use machine should get at least 1 weeks' notice.

Downtime announcements should specify the following:

What machines and services (i.e. printers, software, etc.) are affected, and suggested alternatives to them.
When the downtime will occur, and when the machines will be be available again. (Be liberal with estimates, to allow for Murphy's Law). The ideal period for downtime is between midnight and 8am.
Why the downtime is happening.

If you receive mail from a faculty member who needs the machine during this time, make the utmost effort to reschedule or arrange alternate resources for them.

Unused or single-user machines (diskless workstations, workstations which provide service to one person only) may be shutdown without notification provided the machine is currently unused (and probably won't be soon), or the user of the machine agrees to the downtime.

Two time slots are reserved for times when any system may be shutdown with very short notice. These time slots are

10:00pm Wed -- 1:00am Thu
6:00am Sun -- 10:00am Sun

Unplanned Downtime

The Tutors (x4023) should always be informed of any crashes, and given a time estimate of recovery. Since they are the first contact for the user population, this is important.

Also, there will be white boards in the terminal room and on the door to the Systems Office for systems messages (i.e. `Eecs down, hardware problems, running diagnostics, back up by 12:30 (hopefully)'). If necessary, another one may be located outside the CS office.

Emergency Downtime

There are situations when a machine may need to be rebooted either due to software failures or the entire OS locking up.

When a machine is in a state where it is no longer functioning properly for a majority of its users, and rebooting it is the only option, it should be rebooted. For example, if all the nfs daemons are dead on a diskless client server, which means none of its diskless nodes or YP clients will work, it is time to reboot.

If at all possible, attempt to perform a shutdown in an emergency situation; 5-10 minutes should be enough lead time. Make sure to mention (via shutdown) that this is an emergency reboot.

If the entire OS is locked up (and you have double-checked) do not hesitate to crash the system and reboot.

Routine Preventive Maintenance

Certain routine maintenance must be performed on the various computer hardware in the department.

As of yet, we have not determined all the hardware needing service and the frequency of such service.

Check supplies: toner cartridges, printer ribons, line-printer paper, spare terminals, cables, etc.
Vacuum out all workstations, X-terminals, and PCs in the terminal room or in offices.
Clean out the line printer: vacuum out paper dust, make sure fans are clean
Clean out the laser printers: vacuum out paper dust, make sure fans and filters are clean.
Lubricate line printer
Clean Screens and keyboards

Weekly

Bi-Weekly

Monthly

Quarterly

Annually

Security

Since we are on the Internet, it is vitally important to maintain security on all of our systems. There are crackers constantly wandering around looking for new playgrounds or bases of operations for themselves. This means that if we are broken into, we could be a launching point for attacks on other sites.

security checklist

This section details the security plan for the machines managed by Sysgroup. For any system to be considered for the addition to the trusted host cluster (i.e. be in `/etc/hosts.equiv'), it must pass all of the following points.

Install the latest version of the OS and have all security patches installed.
NFS Filesystems should be exported with the minimum permissions required to function. `suid' or `root' options should only be used where absolutely necessary, and only to hosts in the trusted host cluster.
Enforce a password policy. Run `npasswd'. Run a password cracker periodically (`cops').
Do not permit group accounts or account sharing.
Accounts without passwords should have their shells as stripped, statically linked, binaries.
Use shadow passwords where possible.
Expire unused accounts on a regular basis. See section Account Policies, for more info.
Checksum common utilities, suid programs, and dynamic libraries. Check for new suid programs (`cops' or `tripwire').
Check for world writable files in system directories (`cops').

Account Security

This section details what should be done when an account is left logged in(4). It is vitally important to follow these procedures because we all make mistakes and we hope that the person who comes upon our account is as nice as we were when we found someone else's account.

When confronted with an account that has been left logged in, one usually feels an incredible rush of power. You can now do anything you want to that account. You can tinker with the account's files. You can send all sorts of mail to other people from it. You can probably even crack into other systems from it. And, best of all, you can probably get away with it, too, if you're careful.

Or you may feel like teaching this person a lesson that will not be forgotten in a hurry. You can tinker with the settings, change their window setups, come up with what appears (at that time) to be cute changes to their tty driver, aliases, shell scripts, etc. In effect, humiliate the user so that they don't ever, ever leave themselves logged in again.

Before doing anything, consider the following:

When a systems person tampers with a user's settings, we are violating our promise to give users as reasonable a right to the privacy of their files as possible within the framework of our computer facility.
It also shows a tremendous lack of professionalism on our part. By indulging in juvenile pranks at the expense of our users, we are setting a horrible example to our user community and hence undercutting our efforts to come up with a legitimate computing facility.
A humiliated user will usually not forget this in a hurry. While that may be our intention, such a user may well become an extremely pissed-off user, who may seize the chance when one of US is left in a vulnerable position and exact some form of revenge which may not be so cute or funny.
This sort of behavior also tends to get out of hand as people with various axes to grind indirectly take out their conflicts with others by "nailing them good" when they leave themselves logged in. This is not the appropriate platform to take punitive action. If there are conflicts that need resolution, they should be dealt with directly through the Systems Manager.
A well intentioned joke can (as has in the past) cause a great deal of inconvenience to the subject, as they may be trying to log in to finish something in a hurry and be delayed, sometimes critically.

We are in the business of running a computer center. We are trying to create a documented list of rules and regulations so that everyone knows what is expected of them and what happens if they break those rules. Any punitive action will be undertaken by the Systems Manager. Individual Sysgroup members may prevent someone from breaking a rule but may not render justice on the spot.

When faced with this situation, the following is the appropriate action to take:

Send mail to the users (preferably as that user), informing them that they left themselves logged in, list some of the dangers of leaving an account logged in (we should probably have a boilerplate form letter in `/folklore' or someplace that can be read in).
MAKE SURE that you identify yourself in this letter. Cc the letter to the Systems Manager. No smart-ass comments. Just the facts in as neutral a manner as possible.
Then log the user out. End of incident.

If a user keeps doing this, the Systems Manager should be made aware of this so that they may be dealt with personally. This is a chance to see if this is genuine ignorance on the user's part (which can be rectified) or sheer obstinance. In the latter case, after repeated chats, their accounts should be de-activated and they will be referred to the Systems Committee for further action.

If the account in question belongs to a Faculty member, they should be notified, as usual, after which the Systems Manager should have a chat with the person. Any problems after this will be sent directly to the Systems Committee with no other action.

If the account in question belongs to a member of the Systems Staff, they will be dealt with much like Faculty, except that all action will take place internally. It is possible for a person to be dismissed from the Systems Staff for persisting.

Non-public Machines

Note that policies for Systems Offices is laid down elsewhere, see section Office Policies, for more info.

People's offices and desks are their own domain and they can do whatever they feel fit with them. Tampering with a person's login session in this situation will be considered equivalent to rifling through their desk. If you have a key to someone's office, it is assumed that you can treat the privilege responsibly and ethically.

The paragraph above is very important. There are many systems people who have keys to these offices and they are all considered trustworthy. If they prove themselves not to be, they will lose both their keys and their job.

Some people may give others permission to use their workstations. In this case, the rules for logging out is governed by that personal agreement. In the absence of such an agreement, no one touches a private workstation, whatever the reason. Allowing people to use a window to check on the load average or status of a system is a convenience that has to be personally authorized by the owner of a workstation. It is not an implied right.

Backups

The Backup Schedule is as follows:

Daily: Daily backups are done Sunday thru Friday; Level 9. Daily backups covers all changes since the last weekly backup. Daily backups are cycled weekly.
Weekly: Weekly backups are done on Saturday; Level 1. A weekly backup covers all changes since the last monthly backup. Weekly backups are cycled monthly.
Monthly: Level 0. A monthly backup covers everything. Monthly level 0 backups are done manually with machines on-line. Monthly backups are cycled monthly.
EOT: This is the end of term backup; Level 0. EOT level 0 backups are permanent backups on new tapes. Important machines, ie Rigel, are backed-up in single-user mode. See section Planned Downtime, for more info. EOT tapes are never cycled.

All backups are recorded and logged for future reference. Vitally important backups should be stored off-site.

File restorations are done in the following order:

Restore from the latest EOT or Monthly tape.
Restore from weekly tapes since EOT or Monthly tape.
Restore from daily tapes since last weekly tape.

The following are some guidelines for using any of the exabyte tape drives:

Send mail to `exabyte' before and after using tape drive.
To prevent confusion, eject your tape when done, the program `mt' will do this.
Don't trust the tape to be in the proper position after a day or two, Rewind and forward space tape for incrementals.
Write protect appropriate tapes.
Group `operator' will own the tape devices, and will have write permissions on them. Therefore to use the tape drive you will have to be in that group.

Software Policies

This chapter details the policies for the installation, storage, and organization of software within the CS department. Some other relevant information can be found in section Software Change Policy

Software Support Policy

This section details the guidelines for installing software on CS systems. Before installing any software be familiar with, and be prepared to follow, our software change policy, see section Software Change Policy for more info.

Types of Software

There are two types of software on a computer system: the vendor-supplied OS and systems software, and locally installed software.

The former should be located in the directories /bin, /usr/bin, /usr/ucb, and possibly others. All of these directories must be in every users path. Efforts will be made to make this set of software consistent between various machines, by augmenting or replacing programs

The latter is what this section concerns itself with.

Local Software Categories

In order to maintain software in a controlled way, software will be categorized according to the amount of effort Sysgroup will put towards its installation and maintenance. The three categories are described below, both in terms of what kind of software is covered, and in terms of what support can be expected.

Supported
Commercial
Unsupported

Supported

Software which in this category must be general purpose, receive widespread use, it should be stable and can be made available on all supported architectures in a uniform manner. This software should also not be a redundant service. The terms used above are defined, for our purposes, as follows:

General purpose: It fufills some generic function. This software should be appropriate to install on all systems. Some examples: editors (emacs), formatters (TeX), programming tools (RCS) and mail/newsreaders (ELM, RN).
Widespread use: This means that it gets used by a large percentage of users in the department.
Stable: We can expect to be around for a while and it's reasonably high quality. This criteria keeps us from installing fad software that dumps core frequently (or fails in other inelegant ways).
Uniformity: The same version of the software should be available on all general purpose CS systems.
Redundancy: To eliminate confusion and possible problems, there should be as little redundancy as possible (i.e. only one `standard' news reader).

Software in this category will be upgraded periodically, and when necessary for normal operation (see section Software Change Policy). When software in this category fails, every effort will be made to diagnose the problem, and, either work with the maintainers of the software to fix the problem or try to fix it ourselves and supply the fix to the maintainers.

Commercial

This is software which is supplied by some commercial entity. It will, in general be treated similarly to to supported software however, we do not have source and are, thus, at the mercy of the supplier's support organization. The reason for this distinction is so that it is clear to the users that if the software breaks we may be unable to fix it and will have to wait for the supplier to fix it.

Unsupported

This is any software which does not meet the preceeding criteria. This may also be software which is being evaluated for possible future supported classification.

Little effort will be given to such software, unless a Sysgroup member has some spare time to spend maintaining it. If the software fails, sysgroup will not be obligated to do anything. Volunteers who wish to work on the software will be given necessary assistance (i.e. manuals, assistance in installing it, &c).

Software Installation

This section describes how and where local should be installed.

Software Organization

Any local software should be installed under the `/usr/local' hierarchy. Small software packages (as measured by the number of binaries) which are supported will be installed in `/usr/local/bin', with their libraries in `/usr/local/lib' and man pages in `/usr/local/man'.

Unsupported software will be installed in a separate hierarchy (i.e. with it's own `bin', `lib' and `man') so that its support status is obvious. This hierarchy will be located in `/usr/local/uns'.

In the case of X software which uses imake, rather than using xmkmf (which will try and put everything with the standard X distribution) you should use the following command:

imake -DUseInstalled -I/usr/local/X11R5/lib/X11/config-uns

This is, essentially what xmkmf does with `-uns' attached.

Packages

Large software packages will be put in a separate hierarchy of their own. There are several benefits to this scheme.

This keeps `/usr/local/bin' uncluttered, which will make searches in $PATH (by the shell) faster.
Duplicate command names can be managed.
Management of packages is simplified by isolating them (e.g. related files can be quickly located).
Changes to packages are isolated so that changes are less likely to affect other packages.

These hierarchies will contain a `bin', `lib' and `man' directories (with the latter being an entire man hierarchy).

Note that in the case of large packages, there is no distinction between support levels made via location of said package. This distinction will have to be made elsewhere.

As a result of these packages, users will have to have an easy means for customizing their $PATH variable. All of the supported packages will be in everbody's default path. Unsupported programs can be easily added by setting a variable before the system default `.cshrc' is sourced.

Software Documentation

Any Software installed on the CS Systems in one of the standard $PATH directories must have a man page installed in the appropriate `man' hierarchy.

Any other documentation (i.e. papers, tutorials, etc.) should be stored in the Systems Office. If appropriate, the tutors should also be given a copy of it.

Supported Software

See section Software Support Policy, section Software Change Policy, section Source Archive and section Software Organization for more info.

Unsupported Software

Unsupported software may be installed by anyone in the `uns' group. In order to become a member of this group you must sign an agreement form. Check with the Systems Manager for details.

The sources for programs should be kept in `/usr/local/uns/src'.

Everything should be installed under `/usr/local/uns/bin', i.e. no packages. See section Software Organization for more info.

Source Archive

Any software installed on CS Systems should have it's source code available to all users (unless restricted by copyrights) under a uniform source tree. All sources should appear under the /src directory, whether physically in that filesystem or linked there via NFS.

Any programs in the source archive should be in a directory with a name which consists of the package name, a hyphen, and the version number.

If any changes are made within the source archive, src-godz@cs.pdx.edu should be notified.

Software Manifests

Every machine in the CS department should have certain software installed to prevent confusion to the user population and us. Some software may not run on certain machines; this is acceptable, and a normal part of managing a heterogeneous network.

Some such software is:

`bash' (links in `/usr/local' and `/usr/local/gnu/bin')
`less'
`top'
`ofiles' or `lsof'

Account Policies

The account system currenly in use was written for the Engineering Computer Network at Purdue University by the following people:

David A. Curry - SRI International (while at Purdue University) Samual D. Kimery - Purdue University Kent C. De La Croix - Purdue University Jeffery R Schwab - Purdue University

Changes were made localy by:

John Jendro Dennis Gilbert Gary Moyer The current version of acmaint is 2.0.

acmaint 3.0 is being written by the folks at Purdue at the moment.

It will include: The use of tcl The restructoring of the database records The option to use a real database program The use of TCP instead of UDP in some parts of the account system

It is hoped to put printer quotas in the database

Layout of database

There are two types of records in the account database. The first is the user record which contains general information about the user. It contains the following fields:

uid: Numeric user id
login: Login name
sid: Student id number
fullname: Full name
office: Office room number
offphone: Office phone
homephone: Home phone
mailbox: Mailbox address
grouplist: Group memberships
affiliationlist: Departmental affiliations
: Creation time (Has no field name)
: Created by (Has no field name)
uninterp: Uninterpreted data

There is 1 account record for every machine that the user has an account on.

machine: Host name
login: Login name
gid: Login group id
passwd: Password (encrypted)
classif: Classification
courselist: Courses
authdept: Authorizing department
authorizer: Authorizing person
expdate: Expiration date (mo/yr)
shell: Login shell
homedir: Home directory
: Creation time (Has no field name)
: Created by (Has no field name)
uninterp: Uninterpreted data

For each group there is a group record.

gid: Numeric Group id
gname: Group name
passwd: Password (encrypted)
authorizer: Authorizing person
: Number of members (Has no field name)
: Creation time (Has no field name)
: Created by (Has no field name)
uninterp: Uninterpreted data

Each user also has a host record which is a list of all hosts a user has an account on, and a student id record which allows a users login name to be determined by student id number.

Daemons

The account system is composed of the following daemons.

acmaint_dbd: This daemon is responsible for making changes to the accounts database, as well as the queue for addme and aack.
acmaint_burstd: This daemon is responsible for handiling packets to acmaint_wired.
acmaint_wired: This daemon is responsible for handiling packets to acmaint_transd.
acmaint_transd: This daemon is responsible for changing the passwd file and group file. This daemon also makes/removes home directories.

Commands to `acmaint_dbd`

The following is a list of the valid commands to acmaint_dbd

`add_group gname' login: add login to group gname
`change_acct login@machine' field value: Change field to value in login@machine account record. If machineis equal to "*", change all machine's account records. possible fields are gid, passwd, classif, courses, authdept, authorizer, expir, shell, homedir, uninterp
`change_group gname' field value: Changes field to value in gname group record. Possible fields are gid, passwd, authorizer, uninterp.
`change_user login' field value: Change field to value in login user record. Possible fields are login, uid, sid, fullname, office, offphone, homephone, mailbox, affils, uninterp.
`create_acct login:machine:gid:passwd:classif:authdept:authorizer:expdate:shell:homedir:courselist:uninterp': Create an account record for login@machine using the suppilied data. The user record for login must already exist.
`create_group gid or *':gname:passwd:authorizer:uninterp: Create a group record for group gname with group id gid. if gid is * generate an appropriate value. Return gname and gid.
`create_user uid/*':login/*:sid:fullname:office:offphone:homephone:mailbox:grouplist:affiliationlist:uninterp: Create a user record for login with user id uid. If uid and/or login is equal to "*" generate appropriate values. Return login and uid.
`delete_acct login@machine': Delete the account recored login@machine
`delete_group gname': Delete the group record gname
`delete_user login': Delete the user record login, plus all account records for login
`fetch _acct login@machine' [field]: Retrieve the account record for login@machine. Return it in colon-seperated format unless field is given, in which case only that field is returned. Possible fields are login, gid, passwd, classif, courses, authdept, authorizer, expir, shell, homedir, creation, createdby, uninterp
`fetch_group gname' [field]: Retrieve the group record gname. Return it in colon-seperated format unless field is given, in which case only that field is returned. Poissible fields are gid, name, passwd, authorizer, nmembers, creation, createdby, uninterp
`fetch_hosts login': return the host record for login
`fetch_user login' field: Retrieve the user record for login Return it in colon-seperated format unless field is given, in which case only that field is returned. Possible fields are uid, login, sid, fullname, office, offphone, homephone, mailbox, groups, affils, creation, createdby, uninterp
`remove_group gname' login: Remove loigin from group gname.
`create_hostname hostname:type:server_name': Create a host record for host. The ID is taken from the hostlist. The user has no choice for hostnumber. type may be one of server, client, or standalone. If type==client then the user must supply a server name.
`delete_hostname hostname': Delete the hostname record hostname
`change_hostname hostname' field value: Change the field to value in hostname hostname record. Possible fields are hosttype, servername, hostname
`fetch_hostname hostname' field: Retrieve the hostname recored for hostname. Return it in colon-seperated format unless field is given, in which case only that fields is returned. Possible fields are hostnum, hosttype, hostname, servername
`fetch_hostnum hostnum': Retrieve the hostname record for hostnum Return in colon-seperated format.
`fetch_sid sid': Retrieve the sid record for sid. Return it in colon-seperated format.

Account Maintenance Programs

The following programs customize the Purdue's original account system to work according to our local procedures. These were written at PSU.

addme

The user who wants an account logs in to one of the sparcs as the user user addme, ths causes the addme program to be run. Addme asks for all the information required to make an account and then contacts acmaint_dbd and tells it to put the information in the queue.

aack

This is run by the system administrator. Menu items 2 throgh 11 allow an administrator to use dbd commands. (Note: items 5 through 8 are not implemented yet)

Menu item 1 looks at all of the entries in the queue and allows the administrator to change the queue record into a database record, and make the account.

Menu item 12 allows the administrator to disable an account with a seeme shell.

Menu item 13 allows the administrator to re-enable an account.

account_maint

Yet to be written (any volunteers?). Will allow the user to request changes to his account information, as well as requesting new groups and group additions.

Finger daemon

The finger daemon on the accounts machine has been modified to look up information in the accounts database. example:

Login Name:        johnj
Name in real life: John Jendro
Groups:            wheel operator sources sworkers tip cmc games 
Mail address:      
This Person has Accounts on: jove.cs.pdx.edu cs.pdx.edu walt-suncs.cs.pdx.edu 

Hostname: jove.cs.pdx.edu
  Group id: 5                           Classif:                
 Auth Dept:                          Authorizer:                
Expiration: never                         Shell: /bin/csh       
  Home Dir: /home/rigel/sys/johnj    Created By: loadpwfile     
Extra Info: 

Hostname: cs.pdx.edu
  Group id: 5                           Classif:                
 Auth Dept:                          Authorizer:                
Expiration: never                         Shell: /bin/csh       
  Home Dir: /home/rigel/sys/johnj    Created By: loadpwfile     
Extra Info: 

Hostname: walt-suncs.cs.pdx.edu
  Group id: 100                         Classif:                
 Auth Dept:                          Authorizer:                
Expiration: never                         Shell: /bin/csh       
  Home Dir: /u/johnj                 Created By: loadpwfile     
Extra Info:

Account Creation

Account Paperwork

Guest Accounts

Currently, due to limited resources in the department, we are not offering any guest access. The only exceptions are:

A volunteer who will be doing systems work. See section Volunteers, for more info.
Someone who can get the Department Head's approval.

In the future, it would be nice to have a guest machine which would have it's own phone lines, &c. With user fees, this system could be self-supporting. OSU has a system like this.

However, with the advent of metro area networks, there are an increasing number of public access internet sites available in the area. A handout listing these will be kept on hand in the CS office.

Modems

A modem is considered up when it is able to answer the phone, connect at the proper baud rate and allow the user to get the prompt from the terminal server (currently malach).

It is the intention of Sysgroup that modems be "up", as defined above, at all times. If more than one is not up it is considered a serious malfunction of the system.

The program modemchk will be used to determine whether a modem is answering properly. If either it or the System Administrator(s) cannot log in, the modem will be considered down.

Factors beyond our control (i.e. line noise, telco problems) may cause the modems to be inaccessible. While we cannot be held responsible for these problems, we will do our best to narrow down the cause and correct it (if possible). Also, we cannot be responsible for problems at the user's end, i.e. improperly configured terminal emulators, etc.

The remainder of this chapter details methods for working with the modem pool.

Diagnosing modem problems

First there are some facts about the modem that need to be gathered.

Which modem is it?
What kind of modem is it (i.e. Practical Peripheral 2400, Practical Peripheral 9600, Hayes 14.4 or Telebit)? Once you know which modem it is then you should be able to determine what kind of modem it is.
Is the modem answering? To find this out dial the modem from any phone, and it should answer with a carrier (A weird whining sound).

Is the modem answering at the correct speed? To find this out run the command cisco malach "show line line#", where line# is the octal of the line you are looking at. You must use the quotes around the argument.

NOTE: This will not help with Telebits, if you are having a problem with a Telebit then contact the Modem Manager

jove% cisco malach 'show line 77'
 Tty Typ    Tx/Rx    A Modem  Roty AccO AccI  Uses    Noise
* 77 TTY  2400/2400  F callin    -    2    3  1342   134446

Location: "Dialup (Ext. 3146 )", Type: "dialup"
Length: 24 lines, Width: 80 columns
Baud rate (TX/RX) is 2400/2400, no parity, 1 stopbits, 8 databits
The escape character is "^^", followed by "x"
The local hold character is disabled
No flowcontrol in effect.
Status: PSI Enabled, Ready, Active, Rcvd CR, No Exit Banner
Capabilities: Notification Set, Autobaud Full Range, Modem Callin
Idle EXEC timeout is 2 minutes.
Idle session timeout is 120 minutes.
Modem answer timeout is 90 seconds
Dispatch timeout is 50 milliseconds
Disconnect character is not set
Activation character is ^M (13)
No output characters are padded
No special data dispatching characters

Look at the second line and under the `TX' should be the speed of the modem.

Fixing modem problems

If the modem is not answering, try cycling the power on the modem and making sure that all cables are connected firmly (Also make sure it is turned on), then try again. if this does not work run cmc (See manual page).
If the modem is answering, but not at the right speed run cmc (See manual page). If the modem still does not work then cycle power and try again.
Sometimes the modem will answer, but you will not get a `malach>' prompt; type in the command cisco-cmd conf malach.

`cmc`

Cmc is used to change the configuration of a modem (or to reset the modems parameters). The format of the command is:

cmc sysname port answer|noanswer modem_type
 or
cmc -m phone answer|noanswer

sysname is the name of the cisco terminal server (`malach')

port is the port number of the modem, in decimal or octal (with a leading 0).

phone is the telephone number that the modem is on.

`answer' or `noanswer' indicates if the modem should answer incoming calls.

modem_type the type of modem, currently `hayes', `pp', `vbis' and `tbit' are defined.

For example:

jove% cmc -m 54206 answer

`modemchk`

Modemchk dials a list of phone numbers and verifies that a modem answers and a malach login prompt is present.

To run modemchk type: modemchk -a

Following is a sample of modemchk's output:

Modem Check-up run at: Tue Dec  3 12:50:19 1991

53145: LOGIN,IN USE
53146: LOGIN
54054: BUSY
54111: LOGIN
54112: NO LOGIN
2nd Try: NO LOGIN
...
55407: NO CARRIER
55408: OUTGOING MODEM FAILED TO RESPOND

A total of 22 numbers were called.

  18: calls connected to a login prompt.
   1: calls connected but could not login.
   1: calls failed with no carrier.
   1: calls failed with a busy signal.
   1: calls failed because the outgoing modem did not wake.

Completed at: Tue Dec  3 13:03:54 1991

Following is a more detailed explanation of the meaning of the messages above:

`CONNECTED TO A LOGIN PROMPT': modemchk was able to get the `malach>' prompt.
`CONNECTED BUT COULD NOT LOGIN': The modem responded but modemchk could not get the `malach>' prompt, this means that malach is misconfigured. Solution: type cisco-cmd conf malach, this will reconfigure malach.
`FAILED WITH NO CARRIER': The modem never responded with a carrier i.e. the phone rang and rang with out connecting, this means that the modem is misconfigured. Solution: use cmc to reset the modem.
`FAILED WITH A BUSY SIGNAL': All of the modems in the rotary are busy, or if the modem is not in a rotary it means that the modem that modemchk tried to call was busy.
`CALLS FAILED BECAUSE THE OUTGOING MODEM DID NOT WAKE': means that modemchk was not able to talk with the outgoing modem. Solution: cycle power on the outgoing modem. See section Outgoing modem, for more info.
`COULD NOT ALLOCATE OUTGOING MODEM': The outgoing modem is being used, modemchk will try to allocate the outgoing modem several more times. If modemchk cannot get the outgoing modem, then either someone else is using it or it is hung.
`IN USE': This means that this modem is currently in use, which means that the rotary is forwarding you to the next free modem.

Outgoing modem

Sometimes the outgoing modem gets hosed, to fix this problem run the following command cmc -m 54210 answer If the modem is still hosed, then recycle the power on the modem. If the modem continues to be hosed contact the modem manager.

Printer Policies

This section applies primarily to printers which have quotas installed (at the moment, only lw3). On any printer with quotas, copies must be either paid for or pre-authorized.

The policy is as follows:

Faculty and staff do not have to pay. (this includes EE faculty and staff.)
Student employees doing department work (e.g. sysgroup, tutors) do not have to pay.
Systems volunteers will receive 100 free pages, unless they fall into the previous category.
Grad students and others working for faculty should get a note from the professor saying the professor authorizes payment for a given number of copies.
All others will have to pay.

The procedure for getting/buying copies on a quota'ed printer is as follows:

Pay for the copies in the CS office. You will be given a note which indicates how many copies you paid for. or
Get a professor (q.v.) to fill out a note for you. then
Take this note to the tutors' room, they will change the quota file appropriately.
If you feel you are entitled to unlimited copies, you will have to contact either trent or johnj to give you such access.

There are two programs for manipulating printer quotas: lwquot, and lwaddquot. These programs are rudimentary, and will be replaced with a more elaborate systems.

The lwquot command will display a user's current allowance on a printer. Notice that it prints the totals; subtract the two numbers to find out how many more copies the user has. If no username is given, `$USER' is used.

% lwquot glatz
75 pages allowed
15 pages used

To update a quota use the command lwaddquot. However the only way to give unlimited access is direct editing of the `quota' file. Note: this command must be run on the machine which has the printer physically attached.

% lwaddquot glatz 100

Important Files

This section describes several of the important files in the quota system, and for the printers in general. Note that these files are only relevant on the machine which actually has the printer connected to it.

`/var/spool/lw3/quotas'

Format of each line is: `login pages-allowed pages-used' Both of the numeric fields are cumulative. A value of `-1' in the second field means that that person has unlimited access.

`/var/adm/lw3.acct'

This file logs the size and user of each file printed. The format of each line is: `pages-printed' host:user For example:

1.00 jove:glennc

`/var/adm/lpd-errs'

This file logs all error messages from the printer. It is useful, if you want to monitor a printer, to use tail -f on this file.

Here is an example of a successful job:

psbanner: jove:glennc Job: x.c Date: Mon May 20 22:31:17 1991 
psif: jove:glennc lw3 start - Mon May 20 22:31:25 1991
psif: end - Mon May 20 22:32:06 1991

And, here is an example of a failed job:

psbanner: jove:glennc Job: stdin Date: Sun May 19 16:00:39 1991
psif: jove:glennc lw3 start - Sun May 19 16:00:44 1991
%%[ Error: undefined; OffendingCommand: we ]%%
%%[ Flushing: rest of job (to end-of-file) will be ignored ]%%
psif: end - Sun May 19 17:37:59 1991

Mailing Lists

In order to keep the mass of mail aliases understandable, the following standards should be followed.

`usenet': All USENET aliases should be on this machine, for example: `psu-msgs', `msgs', `usenet', `news'.
`eecs': All mailing lists which include people from both departments should be here; for example: `sysgroup', `banzai', `exabyte'.
`cs.pdx.edu': This machine should contain all mail aliases which are specific to the CS department, i.e. `cs-faculty'.
`ee.pdx.edu': This machine should contain all mail aliases which are specific to the EE department, i.e. `ee-faculty'.

All the aliases above should have forwarding aliases on all machines, i.e. `sysgroup: sysgroup@cs.pdx.edu'.

Some aliases should be host specific, i.e. `root'.

Majordomo

Most of the departmental mailing lists are under the control of `majordomo'.

Where possible, all new mailing lists should be put under the control of `majordomo'.

Sending the string help to `majordomo@cs.pdx.edu' will give you information on using it. Some other docs are in `/usr/local/majordomo/Description'

Information Services

This chapter details some general policies regarding the various Internet-accessible information services which Sysgroup maintains. All of these information services should have their own alias for the machine to be used when publicizing the service (i.e. `gopher.cs.pdx.edu').

Anonymous FTP

The Computer Science Department has one anonymous FTP site on `ftp.cs.pdx.edu'.

The incoming directory should not be readable to the world to prevent the directory from being used for illicit purposes (i.e. crackers).

The directories in `pub' should be have contain an archive with a single theme (i.e. `rfc', `gnu', &c.) Directories for individuals should be put in a single directory (i.e. `faculty' or `people').

Gopher

The departmental gopher server is located on `gopher.cs.pdx.edu'.

Standards

While the use of explicit and detailed standards serve only to hinder creativity and spontaneity, some general guidelines should be laid out to make cooperation among Sysgroup members easier.

Coding

In general, all coding styles serve to delineate methods for ensuring that source code is:

Portable
Readable
Well structured
Well documented
Compatible with the UNIX philosophy

Anyone doing programming as part of Sysgroup, should be able to ensure that their code meets the aforementioned criteria, and determine methods for doing so. If you are doubtful about a style, ask.

With the use of tools such as indent, specific indentation styles are easily dealt with if they are difficult for the reader to comprehend.

Documentation

Dictating what text formatter or word processor an individual will use is an overbearing policy. As such, this section will specify some general requirements of any formatting system.

Any system used must be available to other members of Sysgroup, and preferably on a UNIX platform.

Any system must be able to produce a reasonable looking laser printed output and a reasonably accurate ascii approximation. This latter item should require a minimum of by-hand changes. Related to the former, any system should have the ability to produce PostScript files, so that the files can be printed by others at a later date.

The documentation systems prefered by the Systems Manager are troff and texinfo. Straight ascii files are quite appropriate when they will be changed frequently or be eliminated soon. In these cases the time investment in a getting something formatted nicely may not be worth the effort.