SIMH Implementation Notes
                            =========================


The following notes pertain to the way certain features (or planned features)
are implemented in SIMH.


---------------------------------------
Address Width and Increment, Data Width
---------------------------------------

The DEVICE structure contains three fields that affect how device data is
examined and deposited.  They are:

  - awidth : address width in bits, 1-64
  - aincr  : increment between successive addresses, nominally 1
  - dwidth : data width in bits, 1-64

The address width determines the range of addresses, and the increment
determines the step between one data element and another, and the data width
determines the size of the data elements.

Here are a couple of example devices from the HP2100 simulator:

  Device  awidth  aincr  dwidth  Address Range
  ------  ------  -----  ------  -------------
   CPU      20      1     16     1 megaword
   DA       26      1     16     128 megabytes
   LPT      32      1      8     4 gigabytes
   PTR      31      1      8     2 gigabytes

These fields affect SCP in the following ways.

  awidth
  ------
    - validates address range in exdep_addr_loop
    - address display width in ex_addr
    - address display width in sim_brk_show

  aincr
  -----
    - decision to print words or bytes in fprint_capac
    - element count for UNIT_MUSTBUF memory allocation in attach_unit
    - element count for UNIT_MUSTBUF file write from buffer in detach_unit
    - address increment for examine loop in sim_save
    - address increment for deposit loop in sim_rest
    - address increment for sim_eval examine in fprint_stopped_gen
    - last valid address calculation for [ALL] in exdep_cmd
    - address increment for ex_addr call in exdep_addr_loop
    - addressing units consumed for ex_addr
    - address increment for examine loop in get_avail
    - address to byte offset calculation for fseek in get_aval
    - rounding number of sim_eval words for deposit loop in dep_addr
    - address to byte offset calculation for fseek in dep_addr
    - address increment for sim_eval print loop in eval_cmd

  dwidth
  ------
    - decision to print words or bytes in fprint_capac
    - data display width in fprint_stopped_gen
    - data display width in ex_addr
    - sim_eval data mask in get_aval
    - sim_eval data mask in dep_addr
    - sim_eval data mask in eval_cmd
    - data display width in eval_cmd
    - byte count for UNIT_MUSTBUF memory allocation in attach_unit
    - byte count for UNIT_MUSTBUF file read into buffer in attach_unit
    - byte count for UNIT_MUSTBUF file write from buffer in detach_unit
    - byte count for UNIT_MUSTBUF buffer copy in sim_save
    - byte count for UNIT_MUSTBUF buffer copy in sim_rest
    - byte count for file read in get_avail
    - byte count for file write in dep_addr

For the DC device, we want to:

  - write words in big-endian format for HPDrive compatibility
  - examine and deposit bytes and 16-bit words
  - examine and deposit machine instructions

Endianness is a problem.  Currently, we have aincr = 1 and dwidth = 8.  We
cannot specify dwidth = 16 because the default sim_fread in get_aval will assume
that the file is little-endian.  But with dwidth = 8, sim_eval gets only one
byte per element, and fprint_cpu will fail.  We cannot supply our own examine
routine that reads a pair of bytes into each element because get_aval masks each
element to the dwidth.

This could be handled in fprint_sym and parse_sym by repacking the sim_eval
array when the routines are called for a DC unit.  But that adds device-specific
tests to generic routines, which seems undesirable.  Also, we only get half the
number of words needed for the longest instruction.  We could get around that by
setting sim_emax to twice the number of words, although that would be
inefficient for the normal case of examining CPU memory.  (In fact, the VAX does
just this, but is specifies sim_emax as 60!)

Might it be possible to define aincr = 2 and dwidth = 16?  We could then write a
device-specific examine to read a pair of bytes into sim_eval in the correct
order.  fprint_sym should then "just work," although the return number of
"addressing units" would have to be doubled for aincr = 2 devices.

Addresses would still be specified in bytes, but they would increment by 2 (the
PDP-11 sppears to work this way).  Specifying an odd address for an aincr = 2
device could either be rejected or could be handled in fprint_sym by printing
the pair of bytes.

---

Actions:

  Command           Displays
  ----------------  ----------------------------------------
  EX DC0 0-3        bytes 0, 1, 2, 3 in octal
  EX -A DC0 0-3     bytes 0, 1, 2, 3 as characters
  EX -W DC0 0-3     words 0, 2 in octal
  EX -W -H DC0 0-3  words 0, 2 in hex
  EX -M DC0 0-3     words 0, 2 as machine instructions
  EX -C DC0 0-3     words 0, 2 as character pairs
  EX DC0 1          byte 1
  EX -W DC0 1       Command not allowed

Fallback display prints "dwidth" field as PV_RZRO, increments by aincr.  Custom
display increments by (1 - return).

Option 1 (aincr = 1, dwidth = 8):

  - sim_eval contains bytes
  - sim_emax must be 20 (10 instructions * 2 bytes/instruction)
  - no custom examine routine needed
  - fallback can be used to display bytes
  - word formats (-C, -M, -W) must pack bytes into words, increment 2

Option 2 (aincr = 2, dwidth = 16):

  - sim_eval contains words
  - sim_emax can be 10
  - custom examine routine reads big-endian word at even address into sim_eval
  - fallback can be used to display words
  - default format displays high/low byte, increments 1
  - word formats (-C, -M, -W) increment 2


---------------------------------
Serial Modem Control Line Support
---------------------------------

Support for modem control lines exists in the following devices:

  - 12587A Asynchronous Data Set Interface
  - 12920A 16-channel multiplexer
  - 12966A BACI
  - 30061A TCI

The lines supported are:

          ---- Control -----         ----- Status -----
          CD   CA   SBA  CH   SCA    CC   CF   SBB  CB   CE   SCF
   Card   DTR  RTS  STx  SRS  SRS    DSR  CD   SRx  CTS  RI   SCD
  ------  ---  ---  ---  ---  ---    ---  ---  ---  ---  ---  ---
  12587A   X    X    X    -    -      X    X    X    X    X    -
  12920A   X    X    X    X    -      X    X    X    X    -    -
  12966A   X    X    X    -    X      X    X    X    X    X    X
  30061A   X    X    X    X    -      X    X    X    X    -    -
  PC Ser   X    X    -    -    -      X    X    -    X    X    -

For any given communicaiton line, there are four possible situations:

  1. Telnet connection, no modem control
  2. Serial connection, no modem control
  3. Telnet connection, modem control
  4. Serial connection, modem control

Modem control is indicated by the simulator calling either "tmxr_modem_control"
or "tmxr_modem_status" for a given line.  Once a call has been made for a given
line, that line is considered to be under modem control from then on (though it
might be desirable to have a control call that resets modem control status).
Modem control is indicated by a flag in the EX_TMLN structure, which means that
the structure must be allocated by either a serial attach or by a modem control
call, and also that it cannot be freed, as modem control status must persist
through a detach/attach sequence.

Modem control and status calls for a serial connection affect the hardware
serial port.  Control calls for a Telnet connection hane no effect, except that
a DTR drop disconnects the Telnet session.  Status calls for a Telnet connection
are simulated; CD, CTS, and DSR are asserted if the line is connected and denied
if the line is disconnected.

The only decision based on modem control is whether to assert DTR and RTS when a
serial connection is made and deny them when the connection is broken.  This is
necessary when not under modem control to enable transmission on the port.  It
must not be done if the simulator is operating the modem lines explicitly to
avoid interference.


--------------------------------------
The User View of Terminal Multiplexers
--------------------------------------

The terminal multiplexer devices (BACI, MPX, and MUX for the 2100, ATCD for the
3000) attempt to present a logical picture of the multiplexer to the user when
interacting via the ATTACH, DETACH, SET, and SHOW commands.  This is complicated
by the requirement for a network listening port and associated attachable unit,
and the potential presence of additional units for controllers or timers.

For example, the 12792 8-channel multiplexer for the 1000 ideally would be
modeled as an 8-unit device, where units 0-7 correspond to multiplexer ports
0-7.  However, this device simulation (MPX) also requires a controller unit
(unit 8) and unit to hold the listening port (unit 9).  These units must be
hidden, so they won't appear in a SHOW MPX report.  Moreover, we want to
prohibit user access to the hidden units. But we must provide a mechanism to
allow attachment of the listening port.

To meet these goals, we want to allow these commands:

  - ATTACH MPX      to attach the listening port
  - ATTACH MPX0-7   to attach individual serial ports
  - DETACH MPX      to detach the listening port
  - DETACH MPX0-7   to detach individual serial ports

We disallow these commands:

  - ATTACH MPX8-9   to attach the controller and listening unit directly
  - DETACH MPX8-9   to detach the controller and listening unit directly

In addition, we must allow the indirect actions invoked by these commands:

  - RESTORE         to attach both listening and serial ports
  - DETACH ALL      to detach both listening and serial ports
  - EXIT            to detach both listening and serial ports

Complicating the model is the fact that RESTORE and DETACH ALL will call the MPX
attach and detach routines directly for the poll unit (unit 9), which we must
allow, and that EXIT will call the MPX detach routine for all unattachable units
(units 0-7 and unit 8), which we must ignore.

Further complications arise from wanting to be compatible between the three
possible front-ends (3.10 extended, 3.10 base, and 4.0).  For a 3.10xtd ATTACH
or DETACH command, the "sim_ref_type" variable is set to REF_DEV if a device is
specified or to REF_UNIT if a unit is specified.  For a 3.10xtd RESTORE, DETACH
ALL, or EXIT command, the variable is set to REF_NONE.  For 3.10 or 4.0,
"sim_ref_type" is a constant REF_DEV.  For all versions, RESTORE sets the
SIM_SW_REST switch, EXIT sets the SIM_SW_SHUT switch, and DETACH ALL sets no
switches.

What we want, then, are these actions:

  Command       3.10 extended   3.10 base       4.0
  -----------   -------------   -------------   -------------
  ATTACH MPX    attach net      attach net      attach net
  ATTACH MPX0   attach serial   SCPE_NOATT      SCPE_NOATT
  ATTACH MPX8   SCPE_UDIS       SCPE_UDIS       SCPE_UDIS
  ATTACH MPX9   SCPE_UDIS       SCPE_UDIS       SCPE_UDIS

  DETACH MPX    detach net      detach net      detach net
  DETACH MPX0   detach serial   SCPE_NOATT      SCPE_NOATT
  DETACH MPX8   SCPE_UDIS       SCPE_UDIS       SCPE_UDIS
  DETACH MPX9   SCPE_UDIS       SCPE_UDIS       SCPE_UDIS

  DETACH ALL    detach net      detach net      detach net
                detach serial   impossible      impossible

  RESTORE mpx   detach net      detach net      detach net
                attach net      attach net      attach net

  RESTORE mpx0  detach serial   impossible      impossible
                attach serial   impossible      impossible


These conditions pertain to the listed actions:

  Action        Unit    3.10 extended           3.10 base               4.0
  -----------   ----    ---------------------   --------------------    --------------------
  ATTACH MPX     0      REF_DEV                 REF_DEV                 REF_DEV
  ATTACH MPX0    0      REF_UNIT                REF_DEV                 REF_DEV
  ATTACH MPX8    8      REF_UNIT                REF_DEV                 REF_DEV
  ATTACH MPX9    9      REF_UNIT                REF_DEV                 REF_DEV

  RESTORE        0      REF_NONE, SIM_SW_REST   impossible              impossible
  RESTORE        8      impossible              impossible              impossible
  RESTORE        9      REF_NONE, SIM_SW_REST   REF_DEV, SIM_SW_REST    REF_DEV, SIM_SW_REST

  DETACH MPX     0      REF_DEV                 REF_DEV                 REF_DEV
  DETACH MPX0    0      REF_UNIT                REF_DEV                 REF_DEV
  DETACH MPX8    8      REF_UNIT                REF_DEV                 REF_DEV
  DETACH MPX9    9      REF_UNIT                REF_DEV                 REF_DEV

  DETACH ALL     0      REF_NONE                REF_DEV                 REF_DEV
                 1      REF_NONE                REF_DEV                 REF_DEV
                 8      impossible              impossible              impossible
                 9      REF_NONE                REF_DEV                 REF_DEV

  EXIT           0      REF_NONE, SIM_SW_SHUT   REF_DEV, SIM_SW_SHUT    REF_DEV, SIM_SW_SHUT
                 1      REF_NONE, SIM_SW_SHUT   REF_DEV, SIM_SW_SHUT    REF_DEV, SIM_SW_SHUT
                 8      REF_NONE, SIM_SW_SHUT   REF_DEV, SIM_SW_SHUT    REF_DEV, SIM_SW_SHUT
                 9      REF_NONE, SIM_SW_SHUT   REF_DEV, SIM_SW_SHUT    REF_DEV, SIM_SW_SHUT


What we want, then, are these actions for mpx_attach:

  Called for    Unit    3.10 extended   3.10 base       4.0
  -----------   ----    -------------   -------------   -------------
  ATTACH MPX     0      attach net      attach net      attach net
  ATTACH MPX0    0      attach serial   SCPE_NOATT      SCPE_NOATT
  ATTACH MPX8    8      SCPE_UDIS       SCPE_UDIS       SCPE_UDIS
  ATTACH MPX9    9      SCPE_UDIS       SCPE_UDIS       SCPE_UDIS

  RESTORE         0     attach serial   impossible      impossible
  RESTORE         8     impossible      impossible      impossible
  RESTORE         9     attach net      attach net      attach net


The 3.10 extended SCP will set sim_ref_type to REF_DEVICE for an explicit device
reference, to REF_UNIT for an explicit  unit reference, and to REF_NONE for
implicit references (i.e., by RESTORE, DETACH ALL, and EXIT).  So for 3.10
extended, this is all that is needed:

  Attach
  -----------------------------------------------
    if sim_ref_type = REF_DEVICE
        attach poll_unit
    else
        attach specified_unit

  Detach
  -----------------------------------------------
    if sim_ref_type = REF_DEVICE
        detach poll_unit
    else
        detach specified_unit

...because sim_ref_type can be only one of the three values.

To support 3.10 base as well, the above must be augmented as follows, assuming
that REF_UNIT has been redefined locally to an undefined value (i.e., one that
doesn't match what SCP returns for a unit reference):

  Attach
  -----------------------------------------------
    if sim_ref_type = REF_DEVICE
        attach poll_unit
    else if sim_ref_type = REF_UNIT or REF_NONE
        attach specified_unit
    else
        error

  Detach
  -----------------------------------------------
    if sim_ref_type = REF_DEVICE
        detach poll_unit
    else if sim_ref_type = REF_UNIT or REF_NONE
        detach specified_unit
    else
        error

In this case, we want to prevent all unit attaches and detaches, except in the
case of RESTORE or DETACH ALL, which will attach or detach the poll unit
directly.  If we redefine the REF_UNIT value above to something other than the
REF_UNIT Value set by the ATTACH/DETACH command processors, then any attempt to
attach or detach a unit via these commands will be rejected.

For 4.x support, the issue is complicated by the absence of the sim_ref_type
value.  The above code would work for 4.x if sim_ref_type was set to REF_DEVICE
for ATTACH and DETACH (except DETACH ALL) and to REF_NONE otherwise.  We might
do this by hooking ATTACH and DETACH for 4.x only.

Otherwise, the code above must be modified as follows, assuming that
sim_ref_type has been defined as a constant with value REF_DEVICE:

  Attach
  ----------------------------------------------------------------------------
    if sim_ref_type = REF_DEVICE and specified_unit = unit_0
        attach poll_unit
    else if sim_ref_type = REF_UNIT or REF_NONE or SIM_SW_REST in sim_switches
        attach specified_unit
    else
        error

  Detach
  ----------------------------------------------------------------------------
    if sim_ref_type = REF_DEVICE and specified_unit = unit_0 or poll_unit
        detach poll_unit
    else if sim_ref_type = REF_UNIT or REF_NONE or SIM_SW_SHUT in sim_switches
        detach specified_unit
    else
        error

While the above works "universally," it contains redundancies for 3.10 extended.
If the value is REF_DEVICE, the specified unit will always be be unit_0, and if
sim_switches is SIM_SW_REST or SIM_SW_SHUT, the value will always be REF_NONE,
Moreover, this extra code must be replicated in all multiplexer devices.
Support for 4.x is problematic and may have to be discontinued, depending on
future changes.  So it would be better to have all of the 4.x dependencies
centralized, ideally in a separate source module that could be dropped, but
otherwise at least in the hp_sys module, isolated by conditional compilation.

-----

Could this be simplified?  What about:

  status = tmxr_attach_unit (&mpx_desc, &mpx_poll, uptr, cptr);  // or mpx_poll_number?

...where the extended version does the above REF_DEVICE test, and the
non-extended version uses this substitution:

  if (uptr == mpx_desc.dptr->units                        // if unit 0
    || uptr == mpx_poll && sim_switches & SIM_SW_REST)    //   or poll unit and restoring
      status = tmxr_attach (&mpx_desc, &mpx_poll, cptr);
  else
      status = SCPE_NOATT;

...expressed as:

  #define tmxr_attach_unit(mptr,pptr,uptr,cptr) \
    ((uptr) == (mptr)->dptr->units || (uptr) == (pptr)) \
      ? tmxr_attach (mptr, pptr, cptr) \
      : SCPE_NOATT)

For detach, what about:

  status = tmxr_detach_unit (&mpx_desc, &mpx_poll, uptr);  // or mpx_poll_number?

...for the extended version, and:

  if (uptr == mpx_desc.dptr->units                        // if unit 0
    || uptr == mpx_poll)                                  //   or poll unit
      status = tmxr_detach (&mpx_desc, &mpx_poll);
  else
      status = SCPE_NOATT;

...expressed as:

  #define tmxr_detach_unit(mptr,pptr,uptr) \
    ((uptr) == (mptr)->dptr->units || (uptr) == (pptr)) \
      ? tmxr_detach (mptr, pptr) \
      : SCPE_NOATT)

If this works, then we won't need to simulate "sim_ref_type" nor trap the ATTACH
and DETACH commands in hp_sys.c, and we wouldn't need a special 4.x in
hp_defs.h.


---------------------------------
Calibrated Timers and Breakpoints
---------------------------------

Each time SCP stops execution for a breakpoint, in particular string
breakpoints, all calibrated timers are reinitialized.  This is due to the
"sim_rtcn_init_all" call just before the "sim_instr" call in "run_cmd".  This
resets the timers to their initial values, which are typically much faster than
the eventual calibrations, and restarts the calibration process.  In a command
file that has a large number of prompt/response pairs, the timers are
continually reinitialized, so calibrated operation never occurs.  This can be
seen in the HP 3000 "diag-online.sim" execution, where MPE reports six seconds
of CPU time, even though only about three seconds of wall-clock time has
elapsed.  Running the same command file with 4.x reports one second of CPU time.

SIMH 4.x provides for this by replacing the "sim_rtcn_init_all" call with a call
to a new "sim_start_timer_services" routine (sim_timer.c).  The comments for
that routine say:

  If we're quickly running again after being stopped for less than the time of
  one calibrated clock tick, then don't force a complete recalibration of any
  timers that may have been previously running.

Basically, the initialization call is skipped if the difference between the
current (i.e., restarting) time and the time at which simulation last stopped is
less than ten milliseconds.

We'd like to do the same thing in 3.x, but there currently isn't any way to do
that without replacing "run_cmd" in its entirety.  We already shim "run_cmd",
but we can't, e.g., save and restore the calibrated timer arrays around the
initialization call in "sim_timer.c" because they aren't global.  Attempting to
re-initialize each timer to its current value in the "sim_instr" postlude helps
somewhat, but it also restarts the one-second initial calibration period.
Typcially, the prompts and responses occur much faster than one second apart, so
the calibration period never completes.

It would be possible to hook the call in "run_cmd", but the mistiming only shows
up in situations where large numbers of prompt/response pairs are scriped, and
timer calibration is important.  In practice, whether the diagnostic reports the
correct CPU time is irrelevant, and most cases where the time is important,
scripting only occupies a small startup regime, e.g., to set the system clock.

At the moment, this issue is unresolved.


----------------------
Buffered Serial Output
----------------------

Buffering TMXR output speeds up Telnet nicely, but it slows down serial output.
Consider a 100-character write.  MPE writes 80 bytes plus an ENQ to the output
buffer, taking "n" milliseconds.  The ENQ forces a flush, and those 81 bytes are
sent to the terminal at 9600 baud.  Then a delay ensues until the terminal
returns ACK, and another delay (<= 10 milliseconds) ensues until the input poll
is performed.  Then the serial line is idle again while the output buffer is
filled. This only affects 3.x output, as 4.x serial output is not buffered.

Serial buffering is desirable in the case of the HP2100 MPX device.  When
processing input editing characters, MPX may need to output multiple characters
in a single event service call.  For example, it responds to receiving BS by
sending SPACE and BS, and to receiving DEL by sending BACKSLASH, CR, and LF.
The current code cannot handle SCPE_STALL, which would occur (and does occur in
4.x) if the serial line is unbuffered.  So MPX calls "tmxr_linemsg" which, for
4.x, detects the stall and sits in a 10-millisecond "sleep" loop while waiting
for the character to be accepted.  This isn't ideal, as it may cause the clock
to lose wall calibration.  The 3.x version of "tmxr_linemsg" just blindly
assumes that "tmxr_putc_ln" cannot fail and would lose characters if a stall
occurred.  Moreover, the buffer size is a constant in 3.x, so there's no easy
way of shortening the buffer length except by shimming "tmxr_putc_ln" and
calling "tmxr_poll_tx" after each character output.

Ideally, the first few characters would be flushed from the buffer so that the
serial port could begin output almost immediately after generation.  Then, while
those are being output (at a maximum rate of 9600 baud, or about one per
millisecond), the remainder of the buffer could be filled and then posted to the
serial port.  This would maximize throughput.

In practice, this isn't that much of a concern, for two reasons.  First, a full
line of 80 characters can be output to the buffer in FASTTIME mode in about 3
milliseconds.  So the delay only represents an increase of about 4%.  Second, an
unconditional buffer flush is performed at every poll service entry, i.e., every
10 milliseconds.  So that 80-character output string may fall partly across the
poll time, in which case the serial output will begin before the full string is
output.  In this case, the 4% delay represents a maximum delay; the average will
be less.

As noted above, it would be possible to shim "tmxr_putc_ln" and automatically
flush the buffer after, say, every 30 characters (the BACI does an ENQ/ACK
handshake every 33 characters, whereas the other multiplexers use 80 characters
as the threshold).  But it isn't necessary and probably wouldn't have a visible
gain for the added complexity.


------------------------------
VM-Specific Handler Interfaces
------------------------------

Optional VM-specific handlers are implemented as pointers that are statically
initialized to NULL.  A VM may assign one or more of these to point at routines
that will be called by the SCP.  For example:

   extern void (*sim_vm_post) (t_bool from_scp);

and then:

   sim_vm_post = &local_post;

Generally, this is done in the CPU power-on reset section when the initial
memory pointer is NULL, or in the optional one-time VM init.  The latter is done
by defining:

   void (*sim_vm_init) (void) = &local_init;

...which overrides the default NULL value that is established otherwise.


---------------------------
Simulation SAVE and RESTORE
---------------------------

The simulator SAVE command saves the values of the following unit structure
fields:

  filename -- the name of the attached file
  time     -- the unit activation time
  flags    -- the user flag values
  capac    -- the unit capacity
  u3       -- a user-specified value
  u4       -- a user-specified value
  u5       -- a user-specified value
  u6       -- a user-specified value

It does not save:

  pos        -- the current file offset
  buf        -- the I/O buffer word
  wait       -- the current service request time

...or any variable not referenced by a REGister element.


------------
Attach Modes
------------

The SCP "attach_unit" routine tries to open all files in read/write mode.  This
is to permit EXAMINE and DEPOSIT to work on all device files, regardless of the
underlying device.  So, for example, it is deemed desirable to DEPOSIT to a
paper tape reader file or EXAMINE a line printer output file.

This does not usually present any sort of problem, except in a few cases:

  * Read-only devices, such as paper tape readers, will create a new zero-length
    file if the specified file does not exist, unless the -E switch is
    specified.

  * A UNIT_SEQ device, such as a printer, cannot be attached to a pipe because
    the "fseek" resulting from that flag fails.

  * Output to a Unix pipe fails; opening in read/write mode appears to connect
    both ends of the pipe to the same program.

It is desirable to resolve these problems without altering the existing
semantics.

The first problem is easily solved by automatically adding the -E switch to all
read-only devices.  This will require adding an attach routine to the HP 2100
PTR device, which is the only device that relies exclusively on the default
attach behavior.

The second problem can be solved by removing UNIT_SEQ from write-only devices,
although this will eliminate the possibility of positioning the output medium by
setting POS.  Alternately, UNIT_SEQ can be added or removed dynamically in the
device attach routine, depending on a test for a pipe.

The third problem cannot be accommodated by the existing "attach_unit" action.
This is because all files are opened in read/write (i.e., update) mode, and this
interferes with pipe operation by opening both ends of the pipe.  An additional
mode (write-only) must be added for proper pipe operation.

The current "attach_unit" open behavior is influenced by the -E, -N, and -R
switches, and the UNIT_ROABLE flag, as follows:

  -R  -N  -E  RO  Mode  Action
  --  --  --  --  ----  ---------------------------------------------------
   Y   x   x   N   --   "Read only operation not allowed"
   Y   x   x   Y   rb   open for reading
   N   Y   x   x   wb+  truncate to zero length or create file for update
   N   N   x   x   rb+  open file for update

If rb+ fails, then if the error code is EROFS (file resides on a read-only file
system)  or EACCES (permission is denied):

   N   N   x   N   --   "Read only operation not allowed"
   N   N   x   Y   rb   open for reading

If the error code is something else:

   N   N   Y   x   --   "File open error"
   N   N   N   x   wb+  truncate to zero length or create file for update

Logically, the existing code does:

  if -R
    then if UNIT_ROABLE
      then rb (read)
      else SCPE_RO

  else if -N
    then wb+ (read/write new)

  else
    try rb+ (read/write existing)

    if write not allowed
      then if UNIT_ROABLE
        then rb (read)
        else SCPE_RO

    else if -E
      then SCPE_OPENERR
      else wb+ (read/write new)

One option is to add a new -W switch that would work in conjunction with the -N
switch as follows:

  if -W
    then if -N
      then wb (write new)
      else ab (append new)

This will work, but it has the drawback that append mode does not allow file
positioning ("Opening a file with append mode causes all subsequent writes to
the file to be forced to the then current end-of-file, regardless of intervening
calls to the fseek function").  This would prevent setting POS to reposition,
and so would not require the UNIT_SEQ flag, eliminating the fseek call prior to
execution resumption.

Ideally, for normal files we would want mode "wb+" for -N and "rb+" otherwise,
with an fseek to the EOF after attaching, and mode "wb" for a pipe file.  In
the latter case, positioning does not make sense.

The HP simulator devices use the following unit flags for devices that can be
attached to files (i.e., that call "attach_unit"):

  Device  SEQ  FIX  ROA
  ------  ---  ---  ---
  DS       -    Y    Y
  LP       Y    -    -

  DA       -    Y    Y
  DP       -    Y    Y
  DQ       -    Y    Y
  DR       -    Y    -
  DS       -    Y    Y
  LPS      Y    -    -
  LPT      Y    -    -
  PTR      Y    -    Y
  PTP      Y    -    -
  TTYPUN   Y    -    -

...and the following that call "sim_tape_attach":

  Device  SEQ  FIX  ROA
  ------  ---  ---  ---
  MS       -    -    Y

  MSC      -    -    Y
  MTC      -    -    Y

Pipes only make sense for UNIT_SEQ devices, because they cannot be positioned.
The decision as to which end of the pipe to open (read or write) can be made by
looking at the UNIT_ROABLE flag, which is not present on write-only devices.

The "stat" function can be called to get the file type of a specified filename
prior to opening.  The "st_mode" field will be S_IFIFO for a pipe and S_IFREG
for a normal file.  Note that Unix uses S_IFIFO, MSVC uses _S_IFIFO, Mingw uses
_S_IFIFO but defines S_IFIFO as an alias, and Cygwin uses S_IFIFO but defines
_S_IFIFO as an alias.  "stat" returns 0 if the file exists and -1 if it does not.

To handle pipes transparently, we could shim the "attach_unit" routine.  The
"ex_attach_unit" shim would operate as follows:

  is_pipe := stat () = 0 and then S_IFIFO in st_mode

  if is_pipe
    then
      if not UNIT_SEQ
        then error
      else if UNIT_ROABLE
        then add -R
        else add -W

  status := attach_unit ()

  if status = OK
    then
      if is_pipe
        then remove UNIT_SEQ
      else if not UNIT_ROABLE
        then
          sim_fseek (SEEK_END)
          if ferror
            then clearerr

Callers of sequential devices must set the UNIT_SEQ flag in the associated
device attach routine.  Otherwise, a "detach_unit" shim would have to be created
to restore UNIT_SEQ to the unit if the (open) stream refers to a pipe; this
would require an fstat call.

This would then require only this "attach_unit" addition in SCP:

  if -W
    then wb (write new)

...because -R would open "rb" (read-only).  This arrangement does not alter
normal file handling but allows pipes to be specified as output devices.


---------------------------
SIMH Tape Format Extensions
---------------------------

The SIMH tape format specification ("SIMH Magtape Representation and Handling")
says that a conforming tape image contains a series of objects representing
either "metadata markers," such as tape marks, or data records.  Each object is
introduced by a 32-bit control word.  Several four-byte markers are defined, as
is the format of data records, which begin and end with identical data length
control words that bracket the data payload.  All of the remaining control word
bit patterns are reserved.

Data record control words use bit 31 for an error indicator, reserved bits 30-24
"must be zero," and record length bits 23-0 "must be non-zero."  However, the
current SIMH tape library does not enforce this.  It strips bit 31 and uses bits
30-0 as the record length.

Enforcing the reserved bits restriction would limit individual data records to
16 MB each.  As the library only reads and writes full records, a simulator
capable of reading or writing a record of maximum size would require a 16 MB
buffer.  In normal use, this would be far larger than most records.  However,
some tape drives are capable of writing a single record that encompasses an
entire tape reel.  This cannot be accommodated with the current format.

At any given file position, interpretation of a tape image begins with a
four-byte control word.  The current specification defines two divisions of
control words: markers and record lengths.  The assignments are:

  Control Value Range  Assignment
  -------------------  --------------------------------------------
  00000000             Tape mark
  00000001 - 00FFFFFF  Good data record, 1 to 16,771,215 bytes long
  01000000 - 80000000  Reserved
  80000001 - 80FFFFFF  Bad data record, 1 to 16,771,215 bytes long
  81000000 - FFFEFFFE  Reserved
  FFFEFFFF             Erase gap (half-gap in forward reads)
  FFFF0000 - FFFF00FF  Erase gap (half-gap in reverse reads)
  FFFF0100 - FFFF7FFF  Reserved
  FFFF8000 - FFFF80FF  Erase gap (half-gap in reverse reads)
  FFFF8100 - FFFFFFFC  Reserved
  FFFFFFFE             Erase gap (primary value)
  FFFFFFFF             End of medium

Graphically, the primary control words are as follows:

   31  30  29  29  27  26  25  24  23  22  21   [...]   2   1   0
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | 0   0   0   0   0   0   0   0 | 0   0   0   [...]   0   0   0 | tape mark
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | 0 | 0   0   0   0   0   0   0 |          length > 0           | good data record
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | 1 | 0   0   0   0   0   0   0 |          length > 0           | bad data record
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | 1   1   1   1   1   1   1   1 | 1   1   1   [...]   1   1   0 | erase gap
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | 1   1   1   1   1   1   1   1 | 1   1   1   [...]   1   1   1 | end of medium
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+

A problem with the current implementation is that a conforming reader that
encounters a reserved control word in the image file does not know how to
recover.  This hobbles the introduction of new SIMH tape features, as a
simulator using an older tape library version will fail when it encounters a
tape image written by a newer library version.

Having no interpretation of the reserved control word, the reader can recover
only by advancing the file position to resynchronize with the known format.  The
problem is the unknown word may be a single four-byte marker or may introduce a
record of some undefined length.  Without knowing this, the reader can only
report a fatal error.

An additional limitation of the current format is that data records carry only
two identifications: "good" or "bad."  Especially when generating tape images
from physical magnetic tapes, it may be desirable to retain additional
information with the image.  For example, a "marginal data" indication (i.e.,
the data was recovered through error correction rather than a clean read) may be
pertinent.  It may be desirable to keep data associated with the physical tape
(e.g., data density, locations of parity errors within "bad" records, text from
the tape label) with the image.  It may even be desirable to retain the original
NRZI or PE flux changes along with the recovered data.

Also, specific tape drive simulators may wish to store private data with the
image.  For instance, the HP 9144A Cartridge Tape Drive can report to the user
information from the tape that indicates whether the cartridge was factory or
user certified.  This information is stored "outside" of the user-accessible
data area.  This drive also differentiates between a data record that has been
written and one that has been "formatted" but never written.  There is no way to
represent these requirements within the existing format.

Finally, some tape controllers allow a single data record spanning the entire
length of a 2400-foot reel to be written.  At 6250 bpi, this represents over 170
MB of data.  The existing 16 MB limit prevents such a record from being stored
in a SIMH tape image without artificially dividing it into smaller sections.

As a result of these limitations, three changes to the interpretation of the
existing format are proposed:

 1. All control words, including reserved values, are placed into either the
    marker division or the record division.

 2. These two divisions are further subdivided into classes "reserved for SIMH
    use" and classes "reserved for private use."

 3. The length allocation for the record division is extended from 24 to 28
    bits.

The SIMH tape library will be revised to ignore all unknown SIMH-reserved
objects present in an image file.  Private-reserved objects will also be ignored
unless private data support is explicitly enabled.  A simulator requesting such
"extended" support will be able to write private markers and data records, and
will obtain private objects when reading.  If extended support is disabled, the
library's record reading routines will advance past such objects until a known
marker or data record is encountered.  That is, they will be treated as though
they were erase gaps, taking up space in the file but otherwise "invisible" to
the caller.

The benefits of this proposal are:

 - Data records up to 256 MB are possible, ensuring that a single record
   spanning the entire tape reel can be represented.

 - Private data can be kept in the same file as SIMH-standard information.

 - A conforming reader will automatically ignore unrecognized objects in an
   image file.  In particular, the standard data part of a tape image containing
   private data can be successfully read by a reader that does not understand
   the extended format.

 - Existing simulators will not be affected either by private data or newer
   SIMH-standard formats.

To provide this support, the following control word interpretation is proposed:

   31  30  29  29  27  26  25  24  23  22  21   [...]   2   1   0
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | control class |             marker-specific value             | marker
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | control class |               data length value               | record
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+

Control words are written in little-endian format in the tape image file,
regardless of host platform orientation, maintaining compatibility with the
current SIMH format.

The following backward-compatible control class assignments are proposed:

  Class   Value   Assignment
  -----  -------  --------------------------------------
    0       0     Tape mark
    0      >0     "Good" data record
    1      any    \
       ...         | Private data records
    6      any    /
    7      any    Private single-word marker
    8       0     "Bad" data record, no data recovered
    8      >0     "Bad" data record, some data recovered
    9      any    \
       ...         | SIMH-reserved data records
    E      any    /
    F      any    SIMH-reserved single word marker

Currently, two Class F markers are defined: one indicating an erase gap, and the
other indicating the end-of-medium.  An erase gap appears in a file as a set of
four-byte erase gap markers.  The count of markers reflects the physical length
of the gap at the assumed density.  For example, a three-inch gap at 800 bpi
would occupy 2400 bytes on a physical tape.  In a tape image representing an 800
bpi tape, 600 four-byte erase markers would be written.

A tape file need not end with an EOM marker; the physical end-of-file serves the
same purpose.  However, if an EOM is present, the SIMH tape library will not
read or position past the marker.  This implies that an EOM will never be seen
when reading an image in the reverse direction.

Part of the Class F marker range must be reserved to recognize "half-erase-gap"
markers.  These arise because data records occupy a multiple of two bytes, while
markers occupy four bytes.  If a data record that overwrites a longer erase gap
occupies a multiple of four bytes, then it would overlay an integral number of
erase gap markers.  Interpretation in this case is straightforward, as the first
control word following the trailing data record length word is the defined erase
gap marker.

However, if the overwriting data record occupies only a multiple of two bytes,
then it will overlay half of the erase gap marker that follows the trailing
record length word.  Graphically, the problem is as follows:

    ... FE FF FF FF  FE FF FF FF   Original erase gap
  02 00 00 00   |      |     |     New trailing data record length word
    |     |     \      /     \
  02 00 00 00  FF FF FE FF  FF FF  Resulting tape image
  -----------  ------------
  Length Word   Erase Gap

A forward read of the image after the data record retrieves the Class F marker
value of FF FF FE FF because it reads half of the overwritten marker and half of
the following full marker.  This special value is recognized as a "half-gap"
marker, and reading is realigned by backing up the file position by two bytes.
A forward read then continues with the first full erase gap marker of the
remaining gap.

When reading in reverse, the problem is more difficult.  Referring to the above
diagram, after reading full four-byte erase gap markers as the file position
retreats toward the start of the remaining erase gap, the four-byte value
preceding the last full marker of the original gap consists of half of the
overwritten gap marker (FF FF) and half of the upper two bytes of the preceding
data record length word (00 00).  In this case, the Class F marker will have the
value FF FF 00 00.  This too is recognized as a half-gap marker, and realignment
is done by backing up the file position by two bytes to point at the first byte
of the erase gap.  A reverse read then continues with the data record by
retrieving the complete four-byte trailing data record length word.

The difficulty is that "the half-gap marker" is actually a range of Class F
marker values.  They all start with FF FF (the truncated half of the first erase
gap marker), but the following two bytes from the upper part of the length word
may assume almost the full range of 16-bit values: from 00 00 through FF FD.  We
cannot allow the values FF FE or FF FF, because then the marker would have the
same value as a full erase gap or EOM marker.

Graphically, then, the range of values that must be interpreted as half-gap
markers is shown below:

        FE FF FF FF  Erase gap (primary value)
  01 00 00 00   |    Data class 0, lowest count
  FF FF FD FF   |    Marker class F, highest value
          |     |
        ----- -----
        00 00 FF FF  Half-gap read in reverse direction (lowest)
        FD FF FF FF  Half-gap read in reverse direction (highest)

As the lower two bytes of the half-gap marker comes from the upper two bytes of
the preceding control word, Class F markers with values above FF FD FF FF must
be reserved for half-gap interpretation and cannot be assigned as valid markers.
Any Class F marker from the start of the range to the above value can be
designated as a future marker value.  Therefore, the Class F range assignments
are as follows:

  F0000000 - FFFDFFFF  Reserved for future use (available)
  FFFE0000 - FFFFFFFD  Reserved for erase gap interpretation
  FFFFFFFE             Erase gap (primary value)
  FFFFFFFF             End of medium

Values within the reserved erase-gap interpretation subrange are as follows:

  FFFE0000 - FFFEFFFE  Illegal (would be seen as full gap in reverse reads)
  FFFEFFFF             Interpret as half-gap in forward reads
  FFFF0000 - FFFFFFFD  Interpret as half-gap in reverse reads

A conforming writer will never write the illegal marker values, and a conforming
reader will recognize the half-gap marker values and resynchronize as described
above.


Library Implementation
~~~~~~~~~~~~~~~~~~~~~~

A simulator indicates that it wants to use the extended SIMH tape format for a
given unit by including a new MT_F_STDEX symbol in the static initialization of
the unit's flags.  It is defined as:

  #define MTUF_F_STDEX     5
  #define MT_F_STDEX       (MTUF_F_STDEX << MTUF_V_FMT)

The macro defines the unit flag bits that specify the extended SIMH format.  A
simulator can include this symbol in the static UNIT initialization, and it uses
a unit flag area that is already reserved, so the user unit flags are not
affected.

A magnetic tape simulator that wants to support multiple tape formats including
extended SIMH format must declare extended support initially and allow the user
to change formats with calls to "sim_tape_attach" that specifies the -F switch,
or to "sim_tape_set_fmt".  When these routines are called with extended format
currently enabled, a flag is set in the unit's dynamic flags field that allows
future calls to return to extended format.  An attempt to change to extended
format when the flag is not set will be rejected with an Invalid argument error.
This ensures that a tape simulator that does not understand the extended format
will not permit the user to select the format.

When the format is set to MT_F_STDEX, the read routines will return private data
records and markers, rather then skipping over them.  A simulator not prepared
to receive private objects must not permit the user to select the extended SIMH
format.

To ensure this, the "sim_tape_set_fmt" routine is modified to check the current
format on entry.  If it is MT_F_STDEX, then a "dynamic unit flag" is set to
indicate that the "SIMHEX" format is allowed.  A simulator that supports private
objects will initialize the unit format to MT_F_STDEX, while existing simulators
will default the format to MT_F_STD.  The initial entry format establishes
whether the routine will allow the extended format for current and future calls.

A simulator supporting extended format may see one new status code:

  #define MTSE_RESERVED   12

...as described below.  This code is only returned if the unit is configured for
extended SIMH format.

Once the extended format is enabled, private objects may be written or read by
tape library routines.  This is accomplished by adding a new routine to write
private markers:

 - t_stat sim_tape_wrmrk (UNIT *uptr, t_mtrlnt mk);

   The "mk" parameter specifies the private marker class in the upper four bits
   and the marker-specific value in the lower 28 bits.  If the format is not
   MT_F_STDEX, an MTSE_FMT error is returned.  If the class is not the private
   marker class, an MTSE_RESERVED error is returned.

To accommodate private data records, the existing write routine is overloaded as
follows:

 - t_stat sim_tape_wrrecf (UNIT *uptr, uint8 *buf, t_mtrlnt cc);

   The "cc" parameter specifies a data record class in the upper four bits and a
   data record length in the lower 28 bits.  If the format is not MT_F_STDEX,
   then the class can be either the standard "good" or "bad" class, i.e., upper
   four bits are zero or eight; if another class is specified, MTSE_FMT is
   returned.  If a "good" record is specified with a zero length, then routine
   returns MTSE_OK with nothing done.  If the format is MT_F_STDEX, then a
   private data record class may also be specified.  If the specified class is a
   SIMH-reserved class or the private marker class, MTSE_RESERVED is returned.
   If a "good" record is specified with a zero length, MTSE_INVRL is returned.

Private markers and data records are read with the standard read routines,
overloaded as follows:

 - t_stat sim_tape_rdrecf (UNIT *uptr, uint8 *buf, t_mtrlnt *cc, t_mtrlnt max);
 - t_stat sim_tape_rdrecr (UNIT *uptr, uint8 *buf, t_mtrlnt *cc, t_mtrlnt max);

   When the standard format is enabled, the current semantics are unchanged.
   The "cc" parameter returns just the length portion of the data record marker,
   and the return status is MTSE_OK for a "good" record, MTSE_RECE for a "bad"
   record, or a status code corresponding to a standard marker, such as
   MTSE_TMK.  All other objects present in the tape image are ignored.

   When the extended format is enabled, the variable addressed by the "cc"
   parameter must be set before calling the routine to a bitmap of the object
   classes to return.  Each of the classes is represented by its corresponding
   bit, i.e., bit 0 represents class 0, bit 1 for class 1, etc.  The routine
   will return only objects from the selected classes.  Unselected class objects
   will be ignored by skipping over them until the first selected class object
   is seen.  This allows a simulator to declare those classes it understands
   (e.g., standard classes 0 and 8, plus private classes 2 and 7) and those
   classes it wishes to ignore.

   Markers and data records in the SIMH-reserved classes are read and
   interpreted by the tape library, regardless of whether or not the
   corresponding class bits are set.  The bits only affect whether the objects
   are returned to the caller.  Setting the bitmap to zero (no classes selected)
   will cause the routine to return only when it encounters a tape mark,
   end-of-medium, or the physical end of file -- an action identical to that of
   the "space record forward" routine.

   On return, the variable addressed by the "cc" parameter contains either the
   marker class in the upper four bits and the marker-specific value in the
   lower 28 bits, or a data record class in the upper four bits and a data
   record length in the lower 28 bits.  The new MTR_C macro may be used to
   extract the class, and the MTR_L macro may be used to extract the data
   length.

   Standard markers are indicated by the appropriate MTSE return status values.
   If the SIMH-reserved marker class is selected, the marker will be returned in
   addition to being interpreted by the tape library.

   If a "bad" class data record is selected and read, MTSE_RECE will be
   returned, in addition to the class and length in the "cc" parameter variable.
   Reads of the "good" data class and all private data and marker classes return
   MTSE_OK.  If SIMH-reserved classes are selected and read, MTSE_RESERVED is
   returned if the object is not recognized by the tape library; otherwise,
   MTSE_OK is returned for data records, or the MTSE value appropriate for the
   standard marker is returned.

   Reads of marker class objects do not use "buf" and "max" parameters.  When a
   standard or private data record is read, a new MTR_C macro may be used to
   extract the class, and the MTR_L macro may be used to extract the data
   length.

If a simulator supports multiple tape formats, the extended format is selected
by specifying the name "SIMHEX" to a SET FORMAT or ATTACH -F command.  These
commands call "sim_tape_set_fmt" or "sim_tape_attach", respectively (the
latter calls "sim_tape_set_fmt" internally).  This routine is modified to add a
new "SIMHEX" format name, corresponding to the MT_F_STDEX value, to its table of
formats.


Rejected Alternates
~~~~~~~~~~~~~~~~~~~

Alternate ways of enabling the extended SIMH tape format:

 - A new MTUF_STDX unit flag.

   Including this symbol in the static UNIT initialization enables the use of
   the extended SIMH format when the standard SIMH format is selected, i.e.,
   when MT_F_STD is used.  If the unit also accepts the ATTACH -F command to set
   the tape format, then the flag is ignored when one of the other formats is
   enabled.  A drawback is that this method reduces the available user unit
   flags for tape devices, which may impact existing simulators.


 - A "sim_tape_set_fmt" routine call.

   This requires a call from the unit's power-on reset routine that specifies a
   new format string identifier (e.g., "SIMHEX").  A drawback is that if the
   tape device supports multiple formats, and the user has selected a different
   format (e.g., TPC), this will reset it if RESET -P is entered.


 - A new "sim_tape_extend" routine call.

   This would establish the extended SIMH format.  It requires adding a new
   library routine that must be called from the unit's power-on reset routine.
   This has the same drawback as above, i.e., resetting the preference if the
   user enters the RESET -P command.


For the second option, private objects are written with these two new tape
library routines:

 - t_stat sim_tape_wprecf (UNIT *uptr, uint8 *buf, t_mtrlnt cc);

   The "cc" parameter specifies the private record class in the upper four bits
   and the data record length in the lower 28 bits.  The class must be one of
   the private record classes, or MTSE_RSRVD is returned.  If the format is not
   STDX, MTSE_FMT is returned.


 - t_stat sim_tape_wrmrk (UNIT *uptr, t_mtrlnt mk);

   (Operation is as described earlier.)


Private objects are read with these new tape library routines, which also read
SIMH-provided objects:

 - t_stat sim_tape_rprecf (UNIT *uptr, uint8 *buf, t_mtrlnt *cc, t_mtrlnt max);
 - t_stat sim_tape_rprecr (UNIT *uptr, uint8 *buf, t_mtrlnt *cc, t_mtrlnt max);

   The "cc" parameter returns the class in the upper four bits and the record
   length in the lower 28 bits.  The MTR_C macro may be used to return the
   class, and the MTR_L macro may be used to return the data length.

[...except that sim_tape_rprecX will read either a regular data record, a
private data record, or a private marker.  Regular markers are indicated by the
MTSE code.  This is identical to option 1...]


--------------------
Expanded MTAB Access
--------------------

The modifier structure (MTAB) used by the device- and unit-specific SET and SHOW
commands provides a simple and elegant method of manipulating the 16 bits of the
user portion of the UNIT flags field -- the regular MTAB.  For other modifier
targets or to modify numeric value fields, the extended MTAB mechanism offers
complete flexibility, albeit at the cost of the simplicity of regular MTABs.

The main cost is that extended MTABs require the use of validation and print
routines.  Often those routines are trivial, doing nothing more than setting or
clearing a flag in the DEVICE flags field, or printing a word if the flag bit is
set.  If the extended SET takes a numeric value parameter, then the validation
routine must parse the parameter, validate against any minimum and maximum
value restrictions, mask the target, and insert the new value.  If several
fields are to be set, then several validation and print routines must be
written, each of which is only slightly different than the others.

The extra work to use extended MTABs to manipulate DEVICE flags creates the
temptation to use a UNIT flag for an option that is logically part of the device
(e.g., a strap on the controller card) rather than part of the connected
peripheral (e.g., a strap on the drive).  For devices with single units,
distinguishing between the device and the unit is usually not needed.  For
devices with multiple units, though, this blurs the distinction between them,
potentially creating confusion for the user when issuing SET or SHOW commands
("Do I specify the device name or the unit name?").

It would be helpful to have a mechanism that combines the ease of use of regular
MTABs with the flexibility of extended MTABs.

This is a proposal for such a mechanism -- the expanded MTAB.  The goal is to
eliminate the need for validation and print routines in the majority of cases,
or, if elimination is not possible, then to reduce the code in such routines to
just the unique processing required.

Expanded MTABs work with SET/SHOW <device> and SET/SHOW <unit> commands to
modify:

  - The user portion of the "flags" field of the DEVICE or UNIT structure.

  - Any portion of a user field (e.g., "u4") of the UNIT structure (or of the
    DEVICE structure, should one be added in the future).  For units, the
    field designated in unit 0 identifies the field that will be modified when
    another unit is specified (e.g., a command specifying unit 3 modifies the
    indicated field in the fourth UNIT in the units array).

  - Any portion of a global "uint32" scalar variable for a device command or a
    global "uint32" array element corresponding to the unit number for a unit
    command.

The modifications may apply to:

  - A flag or set of flags indicated by a "SET <device/unit> <option>" command.

  - A numeric value field indicated by a "SET <device/unit> <option>=<value>"
    command.

None of these operations require the use of validation or print routines,
although they may be employed if desired.


Overview
~~~~~~~~

A flag modifier operates just like a regular MTAB, except the target is not
limited to the user portion of the "flags" field.  It processes commands of the
form:

  SET <device> <option>
  SET <unit> <option>

...where <option> matches the "mstring" field.

If defined, the validation and display routines are passed the "match" value to
decide whether the value should be stored or printed.  The routines return
SCPE_OK to confirm the action or an SCPE error code to deny the action and print
the associated error message.  The routines may also return -1 to deny the
action but return success.  This special return is used when the routine wants
to store or print the value itself.  As an option, the "mask" field rather than
the "match" field may be passed to the routines.  This is useful if the routine
is validating or printing a flag that is being cleared, as in this case the
"match" field is zero.

A value modifier operates by parsing the supplied numeric value and modifying
the target as directed.  It processes commands of the form:

  SET <device> <option>=<value>
  SET <unit> <option>=<value>

...where <option> matches the "mstring" field and <value> is an unsigned number.
Typically, this accepts numbers in the range 0-N and places the result in the
target field.  Parsing and modification may be altered in these optional ways:

  - The parsing radix may be specified.

  - An upper numeric bound may be enforced.

  - A lower numeric bound may be enforced.

  - The stored value may be biased to zero by subtracting the lower bound (so,
    e.g., the range 9-12 may be stored in a two-bit field as 0-3 and then
    reconstituted as 9-12 when it is displayed).

  - The stored value may be represented as a bit in a set of bits, positioned by
    the supplied value, with the bits numbered from right-to-left or optionally
    from left-to-right, and with the bit numbers optionally biased by the lower
    bound.

  - The stored value may be printed only if named explicitly in the command.

  - The stored value may be designated read-only, disallowing the SET command.

These options are activated by specifying expanded modifier flags in the MTAB
entry.


Background
~~~~~~~~~~

The current MTAB structure looks like this:

  - mask    = bit mask for testing the unit.flags field
  - match   = value to be stored (SET) or compared (SHOW)
  - pstring = pointer to character string printed on a match (SHOW), or NULL
  - mstring = pointer to character string to be matched (SET), or NULL
  - valid   = address of validation routine (SET), or NULL
  - disp    = address of display routine (SHOW), or NULL
  - help    = 4.0 dummy

The "mask" field has two interpretations.  If bit 31 = 0, then the entry is a
regular modifier for the UNIT "flags" field, and bits 30-0 provide a mask for
that field (implying that only 15 of the 16 "user flag bits" are available).  If
bit 31 = 1 (MTAB_XTD), then the entry is an extended modifier, and bits 5-0
provide additional flags that configure the entry.


Implementation
~~~~~~~~~~~~~~

The existing MTAB structure is augmented by adding a new "uint32 expander" field
to the end of the current definition.  If this field is zero, which it will be
for all existing simulators, then the entry is not an expanded MTAB, and the
existing interpretation as a regular or extended MTAB is used, depending on bit
31 of the "mask" field.

Bits within the "expander" field control the expanded operation, as follows:

   31  30  29  28  27  26  25  24  23  22  21  20  19  18  17  16
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  | user reserved | - | - | - | N | R | L | P | B | M | G | U | D |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
  |             minimum value             |         radix         |
  +---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+---+
   15  14  13  12  11  10   9   8   7   6   5   4   3   2   1   0

Where:

  D = the modifier applies to devices
  U = the modifier applies to units
  G = the modifier applies to a global variable
  M = the mask instead of the match field is supplied to the validation routine
  B = the user-supplied value is biased by the minimum
  T = the user-supplied value is converted to a bit position
  L = the bit position number increases from left to right
  R = target is read-only; only SHOW is allowed
  N = SHOW only if the option is named in the command

An entry is expanded if either the D or U bit is set.  Unused bits are
reserved for future use.

The value in the "radix" field determines whether the MTAB operation is a flag
modifier or a value modifier.  If the radix is zero, then the operation modifies
a flag and uses the "SET <dev/unit> <option>" command form with the existing
regular MTAB "mask" and "match" semantics.  If the radix is greater than zero,
the operation modifies a value and uses the "SET <dev/unit> <option>=<value>"
command form.

In the following descriptions, the "target" indicates the destination of the
modified value or the source of the displayed value.  The target may be located
in the DEVICE or UNIT flags field, in a specified field within a DEVICE or UNIT
structure, or within a uint32 global variable (for SET/SHOW <device>) or a
uint32 element in a global array (for SET/SHOW <unit>) corresponding to the unit
number.

The G, D, and U bits, plus the "desc" field, determine the target, as follows:

  Bits Set   desc    Target
  --------  -------  -------------------------------------------------------
     D      NULL     The user flags part of the DEVICE "flags" field
     U      NULL     The user flags part of the UNIT "flags" field
     D      pointer  The DEVICE field addressed by "desc"
     U      pointer  The UNIT field addressed by "desc" in unit 0
   D + G    pointer  The global scalar addressed by "desc"
   U + G    pointer  The global array element addressed by "desc" and unit #

Setting the G bit indicates that a global variable is modified or displayed,
rather than a field in the DEVICE or UNIT structure.

Setting the D bit requires a device name to be specified in the SET or SHOW
command.  If the "desc" field is NULL, then the "flags" field of the DEVICE
structure is the target.  If the "desc" field is not NULL, then it points at the
target field in the DEVICE structure if the G bit is clear, or it points at a
uint32 scalar variable if the G bit is set.

Setting the U bit requires a unit name to be specified in the SET or SHOW
command.  If the "desc" field is NULL, then the "flags" field of the UNIT
structure is the target.  If the "desc" field is not NULL, then it points at the
target field in the UNIT structure of unit 0 if the G bit is clear, or it points
at a uint32 array variable if the G bit is set.  In these cases, the target is
adjusted to the UNIT field or array element corresponding to the specified unit
number.

If the D and U bits are both specified, or if the G bit is specified and the
"desc" field is NULL, then an "Internal error" failure occurs.

Setting the M bit supplies the mask field instead of the match field as the
"value" parameter to the optional validation routine.  This is required when
validating that a flag may be cleared, as the match field is zero for such
entries.

The B, T, L, and N bits pertain only to value modifiers.  If the radix field is
zero, i.e., the entry is a flag modifier, these bits are ignored.

The "radix" field is used to parse the value parameter, which is an unsigned
integer, and the "mask" field specifies a set of contiguous bits that identifies
the field within the indicated item to modify or display.  For a SET command, if
the "minimum value" field is non-zero, then the supplied value is rejected if it
is less than the specified minimum.  If the MTAB "match" field is non-zero, then
it is interpreted as a maximum value, and the supplied value is rejected if it
is greater than the specified maximum.  If the value passes the constraints, it
is stored in the target.  A SHOW command will extract the value from the target
and display it in the specified radix.

Setting the B bit biases the stored value by the specified minimum value, which
shifts the stored range from <min> to <max> to the range 0 to <max> minus <min>.
The bias is automatically removed when the value is displayed.  This allows a
field to be wide enough to contain the value's range without requiring the width
needed to hold the maximum value.  For example, a modifier specifying the B bit
with a range of 9-12 would require only two bits in the target field.  Without
the B bit, four bits would be required.  This also allows, e.g., the values A-D
to be stored in two bits by specifying the radix as 16 and the minimum value as
10.

Setting the T bit converts the numeric value into a bit whose position
corresponds to the value, counting from right to left, where the LSB is bit 0.
The bit position is converted back into an integer when the value is displayed.

Setting the L bit in addition to the T bit converts the numeric value into a bit
whose position corresponds to the value, counting from left to right, where the
LSB is the bit corresponding to the maximum value.

If the B bit is also set, then the value is biased as above before converting
into bit.  If the L bit is set and the T bit is clear, then the L bit is
ignored.

Setting the R bit rejects SET commands.  Only SHOW commands are allowed.

Setting the N bit suppresses the display of the target value unless the option
is explicitly given in the SHOW command.  This corresponds to the similar
operation provided for extended MTABs.

For numeric and bit-positional modifiers, the value is left-shifted to the
location of the least-significant bit of the mask value, masked, and deposited
into the target location.

The four "user reserved" bits are reserved for user interpretation.  Example
uses might be to indicate how the "desc" field should be adjusted for multiple
instances, or to indicate that special additional actions must be taken by the
validation or display routines.

For SET, an expanded MTAB entry is interpreted as follows:

 1. Test to see if the mstring entry exists.

 2. Test to see that the SET option matches the mstring.

 3. Test to see if SET is allowed for the entry (not read-only).

 4. Test to see if the entry is valid for the type of SET being done (device or
    unit).

 5. Determine the target location for the value.

 6. If a radix is defined, parse the value specified after the option name and
    verify that it is within the range allowed; convert to a positioned bit if
    requested.  If a radix is not defined, then the match field (or the mask
    field if the M bit is set) supplies the value.

 7. Call the validation routine, if any.

 8. If a radix is defined and the entry specifies biasing, subtract the
    specified minimum from the numeric value or right-shift the positioned bit.
    Then left-shift the value to align with the mask field.

 9. Apply the mask value to the target location and then merge the value into
    the location.

For SHOW, an expanded MTAB entry is interpreted as follows:

 1. Test to see if the pstring and mstring entries exist.

 2. If an option is specified, test to see that it matches the mstring and that
    a radix is defined, as named commands are allowed only for numeric values.
    If an option is not specified, and the entry requires a named command,
    return without displaying anything.

 3. Test to see if the entry is valid for the type of SET being done (device or
    unit).

 4. Determine the source location of the value.

 5. Apply the mask value to the source value.

 6. If a radix is defined, right-shift the value to right-align it and remove
    any applied bias.  If a radix is not defined, test to see that the masked
    value equals the match value.

 7. Call the display routine, if any.

 8. If a radix is defined, convert from a positioned bit back to a bit number if
    specified and then print the pstring and the value in the specified radix.
    If a radix is not defined, print the pstring by itself.


Validation and Display Routines
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The validation routine, declared as:

  t_stat validation_routine (UNIT *uptr, int32 value, char *cptr, void *desc)

...confirms that the target location can be set to "value".  The display
routine, declared as:

  t_stat display_routine (FILE *st, UNIT *uptr, int value, void *desc)

...confirms that the source value should be displayed.  For these routines, the
parameters they receive for the various expanded modifiers, are determined by
the D, U, and G bits, and the content of the "radix" and "desc" fields, as
follows:

  ----- Modifier Entry -----  ---- Routine Parameters ----
  D  U  G  rdx  desc field    uptr    value  cptr  desc     Destination or Source
  -  -  -  ---  ------------  ------  -----  ----  -------  ---------------------
  0  1  0   0   NULL          unit N  match  NULL  device   UNIT flags field
  0  1  0   0   UNIT field    unit N  match  NULL  ufield   UNIT user field
  0  1  1   0   array of N    unit N  match  NULL  element  array element

  1  0  0   0   NULL          unit 0  match  NULL  device   DEVICE flags field
  1  0  0   0   DEVICE field  unit 0  match  NULL  dfield   DEVICE user field
  1  0  1   0   uint32        unit 0  match  NULL  scalar   scalar variable

  0  1  0   n   NULL          unit N  num    parm  device   UNIT flags field
  0  1  0   n   UNIT field    unit N  num    parm  ufield   UNIT user field
  0  1  1   n   array of N    unit N  num    parm  element  array element

  1  0  0   n   NULL          unit 0  num    parm  device   DEVICE flags field
  1  0  0   n   DEVICE field  unit 0  num    parm  dfield   DEVICE user field
  1  0  1   n   uint32        unit 0  num    parm  scalar   scalar variable

On entry to the display routine, the "stream" parameter is always the open
output stream, and the "value" parameter is always the source value.  The "uptr"
and "desc" parameters are as above.  On entry to the validation routine for a
flag modifier, the "mask" field is substituted for the "match" field if the M
bit is set.  On entry to the validation routine for a value modifier, "cptr"
points at the value parameter string, and "value" will be the parsed value or -1
if parsing fails.

The return value from the routines is SCPE_OK if the value may be stored or
displayed, or another SCPE error code if the value must not be stored or
displayed due to an error, or -1 to indicate success but inhibit storing or
displaying the value because the routine itself has already done it.

An example of a validation routine that sets the value itself is one used to
validate a channel switch value that ranges from 0-7 but also has an OFF
position.  If the user had entered a SET MT CHANNEL=OFF command, then the
routine would be entered with the "value" parameter set to -1 and the "cptr"
parameter pointing at the "OFF" string.  The routine would then take the
appropriate action for the OFF setting and return -1; this is treated as
success, except that the value will not be set after returning.


Summary
~~~~~~~

Expanded MTAB entries won't address all of a simulator's modification and
display issues, but it should reduce the complexity substantially for those
actions that essentially duplicate existing SCP field modification actions.


Extensions
~~~~~~~~~~

It might be useful to add "userflags" fields to the DEVICE and UNIT structures.
The current arrangement has both system flags and user flags contending for the
same 32-bit space, and a few device simulators use all or almost all of the
available user flags.  If in the future the system set must expand, it runs the
risk of reducing the user field beyond the requirements of those devices.  Also,
the user flags field is partly occupied by the magnetic tape library flags, and
if the latter must expand, the former may overflow.  If expansion forces a
device to move some user flags to one of the user fields (i.e., u3 through u6),
and those fields are already committed, a substantial rewrite of the device
simulation may be required.

Appending a "userflags" field to the end of the existing DEVICE and UNIT
structures would not require any changes to existing simulators, but it would
allow devices with extensive option settings to maintain them in single
locations that would be unaffected by any system flags expansion.  Manipulation
of such new fields could be easily accomplished by adding an MTAB_EUF modifier
to expanded MTAB processing to provide for a "userflags" target.


-------------------------
Multiple Device Instances
-------------------------

Computer systems often employ multiple interface cards of a particular type.
Common examples are terminal interfaces and multiplexers, line printer
interfaces, and disc and tape drive interfaces.  In some extreme cases, e.g.,
the classic HP 3000 machines, there are only two interfaces: one for terminals
and one for HP-IB parallel devices.  Simulators for these machines must, of
necessity, provide multiple instances of the devices.  Currently, these must be
implemented at the VM level, as SCP provides no facilities to duplicate devices.

This is a proposal to add SCP assistance to allow a VM to designate "clonable"
devices and to allow the user to specify the number of instances of each of
those devices.  That is, if a given device permits it, the user may increase or
decrease the current number of instances of the device.  Such devices would
start out with one instance (the original device), and the user may increase (or
subsequently decrease) the number of "clones" of that device to fit the system
configuration requirements.

The SCP alterations to support multiple device instances are constructed to
require no changes to any simulator that does not support cloning.  So all
existing simulators will work "as-is" without changes.


Overview
~~~~~~~~

The SCP additions needed to support multiple device instances are:

 - One new user command to change the number of device instances.

 - One new entry for the command in "set_dev_tab".

 - One new command handler function.

 - One new DEVICE flag to indicate that a device is a copy.

 - One new DEVICE structure field to point at a VM support routine.

The existing "sim_devices" array is enlarged to allow for additional device
instances, but the structure (i.e., a series of device pointers ending with a
NULL pointer) is unchanged, so all of the SCP and VM routines that scan this
array are not affected.

A device simulator that supports multiple instances must be constructed in a
specific way:

 - The global "sim_devices" array defined by the VM must be larger than the
   number of initialized elements.  The additional NULL elements provide the
   space for the creation of additional device instances.  The total number of
   dynamically created clones for all devices is limited by the number of
   additional array elements provided by the static "sim_devices" array.  This
   limit should be set to reflect the expansion limitations of the simulated
   hardware, where the number of device select codes, channel numbers, interrupt
   priorities, etc. determines maximum the number of additional I/O cards
   that may be added to the system.  Clones for a given device will be inserted
   into the array immediately after the original device, so a device and its
   clones will always be contiguous.

 - All state variables pertaining to a given instance must be collected in a
   structure that can be easily duplicated.  Typically, a single structure is
   used with possible sub-structures, although several independent structures
   could be used at the expense of more complex duplication.  This structure is
   statically allocated for the initial ("original") instance.  It will be
   dynamically allocated by SCP for additional ("cloned") instances.

 - All references to the state structure must be through pointers.  Each
   instance will have its own pointer that points to its own state structure.

 - The device must maintain an instance identifier that is unique for each
   instance.  A typical instance ID is simply the copy number, with zero
   identifying the original instance.  If the device defines a Device
   Information Block (DIB) as the device context, the instance ID may be stored
   as a field of the DIB.  Another possible location might be within the user
   flags in the DEVICE structure.  The instance ID is used by the device's
   various internal routines to obtain the pointer to the current instance's
   state structure.

 - The device must provide an instance adjustment function and place a pointer
   to the function in the new "adjust" field of the DEVICE structure.  When this
   field is non-NULL, SCP considers the device to be clonable.  The adjustment
   routine is responsible for adjusting the copy of the operating state created
   by SCP to fit the requirements of the cloned device.  The responsibilities of
   this routine are detailed below.


Implementation
~~~~~~~~~~~~~~

The required SCP changes follow.


1. The new SCP global command is:

   SET <dev> COUNT=<n>

It sets the total number of devices to the specified value, so if count = 3,
there will two copies created plus the original device.  A subsequent "SET <dev>
COUNT=4" would add one more copy, while "SET <dev> COUNT=1" would remove all
copies, leaving only the original device.

Neither SCP nor any existing VM uses the COUNT modifier, so it should be safe to
use it here.


2. The new entry in "set_dev_tab" is:

   { "COUNT", &set_dev_count, 0 },

This provides the linkage from the COUNT keyword to the command handler.


3. The new command handler declaration is:

    t_stat set_dev_count (DEVICE *dptr, UNIT *uptr, int32 flag, char *cptr);

The routine is called when the user enters the new command, with "dptr" pointing
at the DEVICE structure to clone, "uptr" pointing at the device's UNIT array,
"flag" set to 0, and "cptr" pointing at the character string denoting the new
device count.

After validating that cloning of the device is permissible, the routine
allocates new DEVICE structures and inserts the device pointers into the
"sim_devices" array after the original device and any preexisting clones.  The
original DEVICE field values are copied into each new device.  A new name array
is allocated for each device and is initialized to a permutation of the original
device name using one of two specified algorithms (the VM adjustment routine can
change the names if it desires).

Each device is also allocated a new UNIT array and a new REG array; these are
initialized to the values in the corresponding arrays in the original device.
The MTAB array is not duplicated, as local copies are not needed unless one or
more of the "desc" fields point at local state variables.  The VM adjustment
routine handles this if needed.  All devices also have their "ctxt" fields set
to NULL; the VM routine is responsible for allocating new device contexts if
needed.

If clones are to be deleted, the VM routine is called to free any allocations it
made.  Then the command handler frees the device names, UNIT and REG arrays, and
the DEVICE structures.  It then removes the corresponding "sim_devices" entries.


4. The new DEVICE field appended to the existing structure is:

   uint32 (*adjust) (struct sim_device **dvptr, int32 delta);

The field is initialized by the VM to point to a per-device support routine that
adjusts the newly allocated DEVICE, UNIT, and REG structures, relocates any
state-specific pointers in these structures, and performs whatever other work is
needed to coordinate the new devices with their dedicated state variables.  The
field defaults to NULL for existing VMs that do not explicitly support dynamic
allocation.

The adjust routine is called in one of two ways.

If, on entry, the "dvptr" parameter is NULL, then the routine returns the number
of elements defined for the "sim_devices" array in bits 15-0, the maximum count
of instances allowed for the device in bits 23-16, and an indicator of the
device naming algorithm (described below) to use in bits 31-24.  The COUNT
command handler uses the count values to ensure that additional copies of the
device do not overflow the static "sim_devices" array and do not exceed the
limit of devices of the specified type.

If, on entry, the "dvptr" parameter is not NULL, then it points at the first of
potentially several device pointer array entries that have been inserted or will
be removed.  The "delta" parameter indicates the number of device pointers that
have been added (if positive) or that will be deleted (if negative).

The routine is responsible for adjusting the initialization of newly cloned
DEVICE structures, including additional allocations for subsidiary structures as
needed, and for deallocating any structures from clones to be deleted that were
added during a prior call.  It is called by the command handler after the device
structures have been allocated or before the device structures are to be freed.
The routine returns SCPE_OK if all allocations or deallocations succeed.  If the
routine returns an SCP error code, any newly allocated devices are freed before
the error message is printed.

Not all devices will be clonable, e.g., the CPU or the system clock.  If the VM
does not set this new DEVICE field, SCP will reject clone commands for the
device with a "Command not allowed" error.  As existing simulators will default
this field to NULL, none of their devices will be clonable.

Each clonable device will have some upper limit on the number of possible
instances.  For example, an I/O card that has jumpers to select a channel number
from 0-15 cannot have more than 16 instances (the original plus fifteen clones).
Another example is a bus controller that can accommodate no more than four
hardware line printers due to bus loading.  Individual instance count limits can
be imposed, and the number of "extra" elements in the "sim_devices" array
determined by summing the various instance counts.


5. The new device flag is:

   #define DEV_V_CLONE   7
   #define DEV_CLONE     (1 << DEV_V_CLONE)

It is set by SCP on new instances of DEVICE structures.  When scanning the
device list to add or reduce the number of copies of a given device, SCP uses
this flag to determine how many copies of the original device currently exist.

Only devices with DEV_CLONE set will be freed if the number of copies is
reduced.  Because pointers to clones appear in the device pointer array
immediately following the original device pointer, a scan forward from that
point counting DEV_CLONE flags yields the number of clones without having to
keep a separate count of each device's clones.  The flag also prevents the user
from cloning a clone.


State Variables and Registers
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The current state of a simulated device may reside in four places: in a DEVICE
structure, in a device context structure, in one or more UNIT structures, and in
a collection of state variables.  During device cloning, the DEVICE and UNIT
structures are duplicated by SCP.  The device's "adjust" routine is responsible
for duplicating the device context and the set of state variables.  Commonly,
the latter are implemented as global scalars in a device simulator, but
collecting them into a single STATE structure allows for easier duplication.

A related issue is the initialization of the duplicate REG arrays.  Registers
reference many if not all of the state variables, and these too must be
duplicated and accessed per-device.  The register collection typically will
reference fields in more than one structure, e.g., the state held in the DEVICE,
state held within device UNITs, state within the device context such as I/O
address and interrupt priority number, and state within the STATE structure.

When the original REG array is duplicated, the copy still refers to the field
locations in the original state structure.  These must be changed to refer to
the same fields in the duplicate of that structure.  The "loc" field of each
register element is an address that is some fixed offset from the origin of the
state structure itself.  Subtracting the two addresses yields the offset.
Adding that offset to the start of the duplicated state structure yields a
reference into the same field of the duplicate.  If fields from several
structures are present in the REG array, then this process must be repeated for
each duplicate structure.  Expanded MTAB "user reserved" field values might be
used to indicate the base structure used by each register to the "adjust"
routine.  Or the "adjust" routine could be aware directly of which register
"loc" values apply to which structures.

The various routines present in a device simulator must ensure that they use the
per-device copies of each of these structures.  The DEVICE structure contains a
pointer to the UNITs, so gaining access to the former grants access to the
latter.  Access to the local STATE structure is also necessary, so some means of
associating a DEVICE with its dynamically allocated state is required.

Several mapping arrangements are possible.  The simplest is to define a static
array of structures, each consisting of a DEVICE pointer and a STATE pointer,
and sized to hold the maximum number of instances of the specific device that is
reported by the "adjust" routine.  This array is indexed by an instance number
from 0 to N, where 0 is the original instance.  When clones are created, the
"adjust" routine sets a field in each new device's context to the instance
number.  This number is then used to uniquely identify a DEVICE and its
associated state.  The first entry in the mapping array is statically
initialized to point at the static DEVICE and STATE structures of the original
device.


Indirect Access to State Variables
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

A complication arises when a state structure must be accessed via an indirect
reference, e.g., in a routine that is passed a DEVICE or UNIT pointer or a
channel address, rather than the instance number.  In these cases, the passed
parameter must allow a lookup either of the state pointer directly, or of the
DEVICE pointer that can be used to obtain the state pointer.

For DEVICE pointer translation, the device context can be examined to obtain the
instance number that was set by the "adjust" routine, and the instance number is
used to index into the mapping array to get the state pointer.

For UNIT pointer translation, a straightforward solution is to call
"find_dev_from_unit" to get the DEVICE pointer, use that to get the DIB pointer,
and use that to get the state pointer.  However, that routine does a linear
search of the "sim_devices" array each time, so it's poorly suited if the
service routine is called frequently, e.g., to transfer each data byte of a fast
device.

A better solution is to store the instance number in the UNIT structure, e.g.,
in one of the user fields (e.g., "u3") or as a field in the user portion of the
"flags" field.  This can be done during the initial power-on reset that occurs
at simulator startup or when a new clone is "adjusted."

For channel address (e.g.) translation, a separate static mapping table from
channel numbers to instance number can be used.  The table can be populated by
the same mechanism as above for setting the UNIT fields.


Additional Issues
~~~~~~~~~~~~~~~~~

SAVE and RESTORE will need some sort of accommodation to handle cloned devices,
as the saved pointers to dynamic allocations will not be valid upon restoration.
I have not looked into this, however, as I am unfamiliar with the operation of
the commands or ramifications of the changes.


Summary
~~~~~~~

By providing support within SCP for device cloning, most of the work of
creating, initializing, and destroying cloned devices is done on behalf of the
device simulators, with relatively little left to the "adjust" routine.  Adding
clone support to existing devices is mostly a matter of collecting all state
variables into a single structure to make copying easy and modifying internal
utility routines to access those variables through a state pointer obtained from
a mapping array that is indexed by the instance number.  An existing simulator's
static DEVICE, UNIT array, and DIB are all retained.

As the "sim_devices" array retains its current structure, all routines in SCP
and the various simulators that access it remain unchanged.


Extensions
~~~~~~~~~~

Once dynamic DEVICE allocation is implemented, it would be relatively simple to
extend this to dynamic UNIT allocation.  A cloned device is allocated a UNIT
array whose size is determined by the "numunits" field of the DEVICE structure.
It would be a simple matter to change the unit count by dynamically allocating a
larger or smaller UNIT array and adjusting "numunits" appropriately.

Restrictions on the maximum number of units would be necessary, as typically a
device controller supports only a limited number of attached peripherals.


Rejected Ideas
~~~~~~~~~~~~~~

One idea was that the static "sim_devices" array would be used as a template for
a new dynamically allocated array that added or removed pointers to allocated
copies of the specified DEVICE and subsidiary structures.  SCP would be changed
to reference the "sim_devices" array through a global pointer that is changed
whenever the device count is revised and a new array is needed.

This would work and would require less memory than an intentionally oversized
"sim_devices" array.  However, it required changes to SCP and any VM that
accesses its own "sim_devices" array to work through the pointer.  The
complexity isn't warranted, given that "extra" entries only cost four bytes per
potential clone.  Also, the number of possible clones is strictly limited, e.g.,
by the number of device select codes, channel numbers, interrupt priorities,
etc.  Even allowing for the HP 1000 limit of 64 I/O select codes only means an
array that "wastes" 256 bytes.

Bob Supnik's original idea was to supplement the "sim_devices" array with a
linked list of cloned devices.  That also required changes to affected VMs to
scan the linked list to find a given device.

-----

Other arrangements of mapping devices to state structures are possible, for
example:

 1. A global pointer that points at the original DEVICE pointer element in the
    "sim_devices" array, and a global pointer that points at an allocated array
    of STATE structures.

 2. A global pointer that points at the original DEVICE pointer element in the
    "sim_devices" array, and a local pointer in each device context (i.e., DIB)
    structure that points at an allocated STATE structure.

 3. A global pointer that points at a dynamically allocated array of structures,
    each element of which contains a DEVICE pointer and an associated STATE
    pointer.

The arrangement chosen affects how the REG "loc" fields are adjusted.  For the
following discussion, it is assumed that the original device is defined by a
static DEVICE structure and a static STATE structure.

Arrangement 1 places all duplicates of the state structure in an array whose
initial element is a copy of the original state structure.  Each time the copy
count is increased, space for a larger array must be allocated and filled in.
Use of the "realloc" routine will help here, as all prior entries are preserved
in the new array.  However, this may alter the base address of the allocation,
so in addition to the global pointer, ALL of the REG copies must be updated, not
just the new ones added.  Updating is relatively simple, as each duplicated
field is offset exactly one structure-size from the prior one.

Access to both the DEVICE and STATE structures are accomplished by indexing into
the array by the copy number (e.g., "device [copy]->field", where "device" is
the global DEVICE array pointer, and "state [copy].field", where "state" is the
global STATE array pointer).  Reduction in count is trivial -- just a "realloc"
to a smaller state array, plus a check to see if the location changed; if it did
not, then REG adjustment is not needed.

Arrangement 2 allocates space for discrete copies of the device state, with
pointers to the separate copies entered into a dedicated field of the device
context (DIB) structure.  This arrangement has the advantages that expansion and
contraction of the state array is not needed and that previously allocated REG
arrays do not need to be adjusted whenever a new device state is added; only the
new REG arrays do.  However, updating is a bit more complex, as each REG array
must be offset separately from its corresponding state structure address, as the
various state structure allocations bear no relationship to each other.

DEVICE access is identical to Arrangement 1, but STATE access is more
cumbersome, as it is via the device array indexed by the copy number to get the
context pointer, then indirect through that pointer to get the pointer to the
state, and then indirect through that to get the desired state field.
Logically, access is "device [copy]->ctxt->sptr->field", where "device" is the
global pointer to the initial "copy 0" device in the "sim_devices" array.  But
in practice, "ctxt" and "sptr" are void pointers and so must be cast to DIB and
STATE pointers before dereferencing, e.g., "(STATE *) (((DIB *) device
[copy]->ctxt)->sptr)->field".

Arrangement 1 and 2 have the disadvantage is that the "device" pointer can
change if device copy counts, including those of other unrelated devices, are
changed. So, e.g., changing the tape reader's copy count must notify all other
devices that support dynamic allocation that the locations of their devices
within "sim_devices" have changed.

Arrangement 3 allocates a dynamic array of structures, each containing a DEVICE
pointer and a STATE pointer, that is indexed by copy number and is expanded and
contracted as needed.  The DEVICE pointers are the device-specific subset of
entries in "sim_devices", and the STATE pointers point at the corresponding
state structures.

This subset would have to be reallocated whenever the device's "adjust" routine
is called to adjust the number of cloned devices, but only the global pointer
would need to be changed.  This would also eliminate the disadvantage regarding
notifications to all clonable devices.  DEVICE and STATE field access is via
"dsptrs [copy].dptr->field" and "dsptrs [copy].sptr->field".

A simplification of Arrangement 3 is to use a static array of structures
containing the DEVICE and STATE pointers.  The device's "adjust" routine must
return the number of allowable device instances, which may be constrained, e.g.,
by available channel identifiers, to a relatively small number.  Even if the
full complement of 255 devices instances is permitted, the static mapping array
would still be only 2K-4K bytes in size.  Using a static array has the strong
advantage that dynamic allocation is eliminated, along with having to keep track
of the changing pointer to the array.


------------------ old stuff:

SCP will handle allocation and copying of the clonable DEVICE structures, plus
some of the subsidiary structures, such as the UNIT array, that must be copied.
Then it will call a VM-provided routine that will carry out whatever additional
initialization or adjustment of the structures is needed.  Device copies might
need separate MTAB arrays, depending on whether or not the "desc" field points
at device-specific data.  Separate REG array copies will need to be adjusted to
reference per-device state variables.  The VM routine will also allocate and
fill device name strings (selected algorithmically) and VM-specific DIBs.
Copies likely won't need separate debug tables.

If initialization fails, e.g., because memory for the separate structures cannot
be allocated, the routine will return an SCP error, which will cause SCP to
abandon the operation and leave the device pointer array unchanged.  If the
routine returns success, SCP will free the prior device array pointer
allocation, if any, and then change its device array pointer to reference the
new array.

The same user command will be able to reduce the number of device copies by
specifying a smaller number (though not below one).  This will allow user
errors, such as specifying 44 copies instead of an intended 4 copies, to be
corrected without having to restart the simulator.  The VM routine will be
called, this time specifying a reduction, which the routine may accept or
reject.  After VM processing and concurrence, the indicated copies will be
discarded.

After appropriate deallocation, the space occupied by the device pointer array
elements to be removed will be "closed up", and the array allocation will be
shrunk with a call to "realloc".  Device reduction will be inhibited if any
associated units are in the event list or are attached.

The SCP alterations to support dynamic device allocation will be constructed to
require no changes to any simulator that does not support cloning.  So all
existing simulators will work "as-is" without changes.

The SCP changes are:

 - SET <dev> COUNT=<n>

   A new SCP global command.  It sets the total number of devices to the
   specified value, so if count = 3, there will two copies created plus the
   original device.  A subsequent "SET <dev> COUNT=4" would add one more copy,
   while a subsequent "SET <dev> COUNT=1" would remove all copies, leaving only
   the original device.

It might be preferable to create a new command, e.g., "ALLOCATE <dev> <count>",
rather than reserving a name that a device simulator could otherwise use as a
modifier.


 - DEVICE *sim_devptrs [] = sim_devices;

   A new SCP global variable.  Statically initialized to point at the VM's
   device pointer array but can be changed to point at a new dynamic array
   allocation.  SCP uses of "sim_devices [n]" are changed to "sim_devptrs [n]".
   If a VM supports dynamic device allocation, all uses of "sim_devices" must be
   changed there as well.  Otherwise, continued VM use of "sim_devices" is
   permitted.

Clone pointers will be inserted in a dynamically allocated copy of the device
pointer array immediately following the original device pointer.  This provides
a cleaner device display, as well as making it easy to determine the number of
existing clones without having to keep separate counts.  It also simplifies
SCP and VM access, which currently expects a simple array of pointers.  It makes
insertion and deletion a bit harder, but that is only done at cloning time.

It might be desirable to make "sim_devptrs" a local variable and provide a
function that returns its value to guard against any VM alterations.  We need
one or the other -- we cannot, e.g., pass the pointer to the VM during
reallocation because there are some devices that need to walk the device
structure but do not themselves permit cloning.  An example is the HP 3000 IOP
("I/O Processor") device.


 - t_stat (*reallocate) (DEVICE **changed_dptr, int32 changed_count)

   A new DEVICE field, initialized by default to NULL for VMs that do not
   support dynamic allocation.  A supporting VM initializes this field to point
   at a routine that receives notification of reallocations of the device
   pointer array that affect the specified device.  The "changed_dptr" parameter
   points at the first of potentially several device pointer array entries that
   have been inserted or will be removed.  The "changed_count" parameter
   indicates the number of device pointers that have been added (if positive) or
   that will be deleted (if negative).  If this parameter is zero, then the
   "changed_dptr" parameter points at the original device pointer array entry,
   and this is a notification that the device pointer array has moved due to
   reallocation.

   The routine is responsible for initializing newly cloned DEVICE structures,
   including additional allocations for subsidiary structures as needed, and for
   deallocating any structures from clones to be deleted that were added during
   the initialization call.  If the routine returns an SCP error code, the
   addition or deletion is rejected.

   Not all devices are clonable, e.g., the CPU or the system clock.  If the VM
   does not set this field, SCP will reject clone commands for the device with a
   "Command not allowed" error.  As existing simulators will default this field
   to NULL, none of their devices will be clonable.

One issue is how much of the cloning work the SCP routine should do before
calling the VM's device reallocator.  SCP will allocate new DEVICE structures,
initialize them from the original DEVICE, and insert pointers to the cloned
devices in the device pointer array following the original device pointer.  SCP
could also allocate, fill, and set references to clone copies of the UNITs and
REGs, as these will always be needed.  The VM routine is responsible for
altering the REG arrays to point at the correct per-device state variables.

SCP might decide whether to clone MTAB structures, based on whether all of the
original "desc" fields are NULL.  Unless one or more of these fields is set, the
modifiers are not device-specific, and the pointer to the original MTAB table
can be left in the copy of the DEVICE structure.  The VM could use this same
rule to determine whether to modify a cloned array.

SCP would not clone the DEBTAB array, as this would not normally be
instance-specific.  The VM routine could allocate and change a DEBTAB copy if
needed, as long as it also handled deallocation when called to reduce the number
of copies.

The VM routine would be responsible for giving the clones names and allocating
and assigning the name strings.  This seems better than having the user choose
the names during cloning, as (a) presuming a proper choice of naming
algorithms, there would be no need to check for duplicates, (b) the user manual
could identify the naming scheme to be used for clones and so remain directly
relevant, and (c) the routine could likewise derive and set any needed logical
names for the new device clones.


 - #define DEV_CLONE   [...]

   A new device flag that is set by SCP on allocated copies of DEVICE
   structures.  When scanning the device list to add or reduce the number of
   copies of a given device, SCP can use this flag to determine how many copies
   currently exist.

Only entries with DEV_CLONE set will be "freed" if the number of copies is
reduced.  Because pointers to clones appear in the device pointer array
immediately following the original device pointer, a scan forward from that
point counting DEV_CLONE flags would yield the number of clones without having
to keep a separate count of each device's clones.  The flag also prevents the
user from cloning a clone.


Example
~~~~~~~

As an example, the process for producing four instances of the "XYZ" device
would involve these steps:

 - The user enters the command "SET XYZ COUNT=4".

 - The set_cmd routine locates the COUNT keyword in the set_dev_tab and calls
   set_dev_count, passing a pointer to the DEVICE structure, a pointer to its
   first unit, a zero value, and a pointer to the "4" in the command line.

 - The command executor checks that "XYZ" is a clonable device, i.e., that the
   DEV_CLONE flag is not set and that the "adjust" field is non-NULL.  If it is
   not clonable, the routine returns "Command not allowed" status.

 - The executor calls dptr->adjust, passing NULL for the changed_dptr parameter.
   The adjuster returns the size of the device table (e.g., 50 entries) in the
   upper word and the maximum number of allowed "XYZ" devices (e.g., 16) in the
   lower word.

 - The executor parses the count from the cptr parameter.  If parsing fails, or
   if the value exceeds the device count limit, the routine returns "Invalid
   argument" status.

 - The executor scans the current device table to locate the device to be cloned
   and to count the current number of device clones and the total number of
   devices in the table.

 - As the COUNT value increases the device count, the needed table size (i.e.,
   current table size + 3) is checked against the reported table size.  If the
   table would overflow, the routine returns "Address space exceeded" status.
   Otherwise, the current table is "opened" by three elements to allow the new
   cloned device pointers to be inserted after the original device and any
   existing clones.  The routine then counts the number of elements in the
   device's REG array.

 - For each new device clone, the routine allocates a new DEVICE structure.
   Each structure is initialized by copying the original DEVICE structure and
   adding the DEV_CLONE flag.  The "lname" field of each copy is set to NULL to
   avoid having multiple devices pointing to the same logical name.  Then the
   routine allocates a new device name array, a new UNIT array, and a new REG
   array, setting the block pointers into their corresponding locations in the
   DEVICE structure.  Each of these allocations is then initialized by copying
   the values from the corresponding item in the original device.  The name is
   made unique by appending to the name a letter that is derived from the clone
   number (e.g., the three new "XYZ" clones are named "XYZA", "XYZB", and
   "XYZC").  Each of the units is "sanitized" by clearing the fields associated
   with event activation and attached files, clearing the UNIT_RO, UNIT_ATT, and
   UNIT_BUF flags, and clearing the UNIT_PIPE dynamic flag.

 - If any of the allocations fail, all prior allocations are freed, and the
   routine returns "Memory exhausted" status.  If all succeed, the routine calls
   dptr->adjust, passing a pointer to the first new device clone pointer entry
   in the sim_devices table and a positive value indicating the number of added
   clones (in this case, +3).

 - The "XYZ" adjust routine performs VM-specific adjustment of each of the newly
   cloned devices.  In this example, it allocates a new DIB structure, a new
   local state structure, and a new MTAB array -- the latter because some of
   their "desc" fields point into the local state structure.  The DIB and state
   structures are initialized as appropriate.  The MTAB array is first copied
   from the original device.  Then each element is examined, and if the "desc"
   field is not NULL, it is relocated from the original state structure field to
   the cloned device's state structure field.  The same relocation is performed
   on the "loc" field of the REG array elements.  If any of the allocations
   fail, all prior allocations are freed, and the routine returns "Memory
   exhausted" status.  If all succeed, the routine returns success status.

 - If the adjuster returned success, the command executor calls the device's
   reset routine (if defined) with the "P" (power-on) switch for each newly
   cloned device.  This allows the reset routine to perform whatever initial
   actions were taken with the original device at simulator startup.

 - The executor then exits, and the user is returned to the SCP prompt.
   Performing a SHOW DEVICES command will reveal the new clones in the device
   table.

If the user then wishes to delete two of the instances, these steps would be
involved:

 - The user enters the command "SET XYZ COUNT=2".

 - The second through sixth steps above are repeated.

 - As the COUNT value decreases the device count, the command executor calls
   dptr->adjust, passing a pointer to the first device clone pointer entry to be
   removed from the sim_devices table, and a negative value indicating the
   number of clones to be removed (in this case, -2).

 - The "XYZ" adjust routine frees the DIB structure, local state structure, and
   MTAB array for each of the indicated device clones.  It also performs
   whatever housekeeping it needs to account for the removed devices.  It then
   returns success status.

 - For each cloned device to be removed, the executor frees the logical name
   array, REG array, UNIT array, device name array, and finally the device
   structure itself.

 - After all device structures are freed, the executor "closes" the sim_devices
   table to remove the two deleted cloned device pointer elements.  It then
   exits, and the user is returned to the SCP prompt.

SAVE and RESTORE likely will need some sort of accommodation to handle cloned
devices, as the saved pointers to dynamic allocations will not be valid upon
restoration.  I have not looked into this, however, as I am unfamiliar with the
operation of the commands or ramifications of the changes.

--------------------- more old stuff:

I ask because my current HP 3000 work on the General I/O Channel (GIC) will
eventually lead to a new 3000 CPU simulator for a class of machines that have
only two I/O card types: GICs and 8-channel terminal muxes.  Each therefore
needs to be supplied with multiple copies of these two devices. The problem is
how many of each.

Given that a GIC can support eight devices (e.g., disc drives), and the t-muxes
can support eight terminals, I'd have thought that four of each would be more
than sufficient.  But in speaking with some old HP customers, I found that some
sites spread their discs one-per-GIC to take advantage of parallelism (each GIC
has a dedicated DMA controller, but it operates only one of the connected GIC
devices at a time).  So some sites used eight GICs or more.

Also, the upper end of the machines I will simulate could support 400 terminals,
so 50 t-muxes.  Now I don't think anyone in their right mind is actually going
to have 400 users on a simulator.  But it was pointed out to me that some folks
might want to load and run their old system dump tapes, and if all of the
configured t-muxes aren't present, the system won't come up.

I know the sim_device array wants to be contiguous, and I know that registers
will be a problem, so I don't think that duplicating "copy-ready" devices could
be handled entirely at the SCP end without a VM assist.  But I was wondering if
you had thought about this and uncovered some show-stopper that makes it all
infeasible.

I haven't given dynamic allocation more than cursory thought so far, but I was
hoping to avoid major changes to SCP.  To that end, I was thinking that SCP
could provide a new hook pointer to a device array that would default to point
at the "sim_devices" array contained in the VM.  A VM that is expansion-aware
could then change it to point at a dynamically allocated device array instance
during startup and any time it wanted to change the device count.  SCP would
then need only be changed to go indirectly through the pointer, rather than
directly through the external.

I was also thinking to keep the contiguous array, so that indexing in SCP or
VMs wouldn't need to change.  The array would be expanded or contracted by
inserting or removing copy entries adjacent to the "master" entry for a
device.

Some of these actions, including the UI, might be profitably implemented in
SCP, with a VM call to make any needed initializations as a result of the
additions or deletions.  I expect that a VM would have a mix of clonable and
non-clonable devices (e.g., we wouldn't want six system clocks).  This could
either be designated by a new device flag if handled by SCP, or determined by
the VM call, which would return failure if the device altered is not clonable.
A VM that did not change the device array hook at startup would not permit
cloning commands.

The devices I'm currently designing for multiple use keep all of their
state in arrays of structures (which was the primary driver for having a
register arrangement that could access structure fields).  HP 2100 and 3000
device context structures contain a "card index" that indicates the copy
number of the device (each device needs its own copy of the context to hold
card-specific things like I/O address).  Typically, devices have several such
arrays, each accessed via the card index.

These are all inchoate ideas at this point.  However, I would want whatever I
come up with to have the lightest possible touch on SCP.


Possible Approaches
~~~~~~~~~~~~~~~~~~~

To add copies of existing devices or to remove them, the currently static
sim_device array must be expanded or contracted dynamically in response to a
user command.  This can be done in several ways.  The existing sim_devices array
can be:

 1. Used to create a linked list of dynamically allocated device pointer
    entries, which is then used to add or remove devices.  SCP exports a pointer
    to the head of the list for use by the VM.

 2. Copied to a larger or smaller array in a block of dynamically allocated
    memory each time the number of devices changes.  SCP exports a pointer to
    the current block for use by the VM.

 3. Defined with a larger number of elements than initial devices, and then
    expanded or contracted in place as the number of devices change.  The static
    size of the array is returned by a call to a VM-defined routine.  SCP
    ensures that the user cannot expand the number of device beyond the table
    size.

All three approaches retain the existing static sim_devices array, so that VMs
that do not support cloning will continue to work as before.  For VMs that do
support cloning, approaches 1 and 2 require that they use the new list pointer
or block pointer to access the device list, as sim_devices is not updated to
reflect device additions or deletions.

Approach 1 requires that SCP and any supporting VM that currently uses the
device list will have to be rewritten to walk the list.  Approach 2 requires
that SCP and VM use the block pointer, but as the block contains an array, the
only change required is the name of the variable used to access the array of
device pointers.  So, e.g., a use such as "sim_devices [index]" is changed to
"sim_devptr [index]".  This works because an array name and a pointer to the
first element are equivalent.

A major advantage of approach 3 is that it requires no SCP or VM changes to use
the dynamic device list.  All references to sim_devices remain valid, even as
the content of the table changes.  Approach 3 also reduces the use of dynamic
allocation and is somewhat simpler to implement.

A drawback of approach 3 is that it imposes a fixed limit on the table size and
therefore on the number of device clones.  This may not, though, be a
significant hardship, as other conditions, e.g., I/O address space, available
channel numbers, etc., might also restrict the potential number of devices.
Also, each spare table element only requires a few bytes, so the memory overhead
of having a large table is not substantial.

Regardless of the approach, VM assistance will be required to ensure that all
device-specific data, such as units and registers, are duplicated, and that
memory references, such as register variable references, are adjusted to point
at the duplicates.


Implementation Overview
~~~~~~~~~~~~~~~~~~~~~~~

A VM must be written to support cloned devices explicitly, as VM assistance is
required to complete the cloned device configuration.  Cloning is initiated by a
user command, e.g., "SET <dev> COUNT=<n>" or possibly "ALLOCATE <dev> <n>".
After parsing and validating the command, a routine is called to add or delete
entries in the device list.  The routine allocates (or frees) DEVICE, UNIT, and
REG structures, and initializes them from the original device.  It then calls a
per-device VM-supplied routine to adjust the initializations as needed.  For
example, the REG structures will contain pointers to the original device, unit,
or other state variables.  These pointers will be copied to the newly created
devices but will need to be adjusted to point at the device-specific state.  The
MTAB array also must be duplicated and adjusted if it contains any references to
state variables.

The command parser, validator, and allocation routine may be located in one of
several locations:

 1. In SCP, driven by the SET command processor.

 2. In the SCP Extensions module, driven by an extension command that hooks the
    standard SET command processor.

 3. In the VM system module, driven by MTAB entries in each clonable device.

Location 1 provides cloning capability to all VMs but will require in addition
to the aforementioned implementation routines:

  * A new device system flag, SIM_CLONE, that marks cloned devices.

  * A new DEVICE field containing a pointer to the VM-supplied, device-specific
    adjustment routine.  Field initialization defaults to NULL, which indicates
    that the device is not clonable.

Location 2 requires no SCP or DEVICE changes but also restricts cloning services
to the HP simulators.  A new VM-wide device user flag marks cloned devices, and
a VM-defined adjust routine is called that then dispatches to a per-device
adjust routine via a pointer in the DIB (the latter is necessary because DIBs
are VM-specific).  An alternative is to repurpose one of the 4.0 compatibility
fields in the DEVICE structure, although this could cause problems if the field
is subsequently interpreted by SCP 3.x.  The exported pointer or size is
supplied by the extensions module.

Location 3 has the same restrictions as Location 2.  It uses the same device
user flag, but the DIB does not change, as the MTAB entry's validation routine
can pass a pointer to the device's adjust routine when calling the main cloning
routine.  However, it requires that the cloning routine be duplicated in each of
the HP simulators, although this could be implemented as a VM-callable routine
in the SCP Extensions module.  Also, each clonable device would need its own
MTAB entry and validation routine to handle clone commands.

Consequently, Location 1 or 2 is preferred.


----------------------
Internal Units Display
----------------------

Some devices use one or more SIMH units for internal purposes.  An example is
the HP3000 DS (disc) device that uses eight regular units to represent
individual disc drives and a ninth internal unit to represent the disc
controller.  To hide the controller unit from the user, it has the UNIT_DIS flag
set, which causes the SHOW DS command to display only units 0-7 as desired.

On occasion, a device needs no units.  An example is the HP3000 IOP (I/O
Processor) device.  However, SCP requires that each device has at least one
unit, so a dummy unit is provided.  If that unit has no unit flags set, the SHOW
IOP command displays "IOP".  However, if the dummy unit has the UNIT_DIS flag,
as logically it should have, then SHOW IOP displays "IOP, all units disabled".
This is confusing, as it suggests that the user can alter the situation with one
or more SET IOPn ENABLED commands.

A similar situation occurs if a single internal unit is used.  UNIT_DIS must not
be set if the "all units disabled" status is to be avoided.  But then a problem
occurs if a second internal unit is subsequently added.  An example is the
HP3000 CPP (Channel Program Processor) device.  This device originally had a
single internal unit to time a completion event.  With UNIT_DIS present, the
SHOW CPP display was:

  CPP, all units disabled

As there are no modifiers to affect the unit, UNIT_DIS was removed, resulting in
the display:

  CPP

A second internal unit was then added to handle a timeout event.  But then the
SHOW CPP display changed to:

  CPP, 2 units
    CPP0
    CPP1

To remedy the confusion, UNIT_DIS must be added to one, but not both, of the
internal units.  Any additional future internal units must also carry the
UNIT_DIS flag.

So to get the desirable display, one and only one internal unit must be enabled.
This requirement stems from these lines in the "show_display" routine in scp.c:

  if (ucnt == 0)
      fprintf (st, ", all units disabled\n");

Here, "ucnt" is the count of enabled units.  The routine also has "udbl", which
is the count of units disabled by the user.  If the above condition were changed
to:

  if (ucnt == 0)
      if (udbl > 0)
          fprintf (st, ", all units disabled\n");
      else
          fprintf (st, "\n");

...then the "all units disabled" message would appear only when the user has
disabled all of the units.  It would be suppressed if none of the device units
can be enabled by the user.


--------------------------------------
Internal Units Display with User Flags
--------------------------------------

A problem occurs if an internal unit has user flag settings.  An example is the
HP3000 CPU, which uses one internal unit to represent the CPU's process clock.
Logically, this unit should have the UNIT_DIS flag, as the user cannot alter any
aspect of the process clock.  However, because the unit has user options
reflected in its "flags" field (for programming convenience, rather than in the
device "flags" field where they belong), adding UNIT_DIS will cause them to be
omitted from the resulting display.  That is, SHOW CPU goes from:

  CPU, Series III, EIS, no CIS, auto-restart, 1024KW, calibrated timing

...to:

  CPU, Series III, EIS, no CIS, auto-restart, realistic timing

...which is not only missing the memory size but is also incorrect because the
CPU is not set for "realistic timing," which is a toggle with "calibrated
timing."

The situation worsens if a second internal unit is added.  Then the display
becomes:

  CPU, Series III, EIS, no CIS, auto-restart, 2 units
    CPU0, 1024KW, calibrated timing
    CPU1, realistic timing

...which is not only wrong but confusing as well.

The workaround here is to add UNIT_DIS to one, but not both, of the internal
units.  The chioce is important, because it must be applied to the unit without
the user flags settings.

The bottom line is: don't place user flag settings on internal units.  There is
no reason to do this once expanded MTABs are available to place them on the
device where they belong.