TITLE	UNARC   CP/M Archive File Extractor

IDENT	MACRO
	DB	'UNARC  1.6  27 Mar 87'
ENDM

; (Remember to update version/date here and maintain history log below)

SELF	MACRO			; Self-unpacking archive file name
	DB	'UNARC16'
ENDM

COPR	MACRO
	DB	'Copyright (C) 1986, 1987 by Robert A. Freed'
ENDM

	.COMMENT |

NOTICE:  This program is the copyrighted property of its author -- it
is NOT in the public domain.  HOWEVER...  Free use, distribution, and
modification of this program is permitted (and encouraged), subject to
the following conditions:

(1)  Such use or distribution must be for non-profit purposes only.
(2)  The author's copyright notice may not be altered or removed.
(3)  Modifications to this program may not be distributed without
     notification of and approval by the author.
(4)  The source program code may not be used, in whole or in part,
     in any other publicly-distributed or derivative work without
     similar notification and approval.

No fee is requested or expected for the use and distribution of this
program subject to the above conditions.  The author reserves the right
to modify these conditions for any future revisions of this program.
Questions, comments, suggestions, commercial inquiries, and bug reports
or fixes are welcomed by the author:

		Bob Freed
		62 Miller Rd.
		Newton Centre, MA  02159
		Telephone (617) 332-3533
|
	PAGE
	SUBTTL	Modification History

	.COMMENT |

1.6	27 Mar 87  (RAF)

- Murphy's Law strikes again:  Within hours after the release of version
  1.5, a bug was discovered.  Incorrect CRC error messages are generated
  during file extraction in some situations.  This was caused by failure
  to clear carry before a 16-bit subtract (SBC HL,DE), which we changed
  inadvertantly in 1.42.  (So much for Beta-testing!)  Such faulty error
  messages occur only for disk file extraction, not when the 'C' command
  option is used to check an archive.  Furthermore, the bug occurs only
  when (1) a file contains an odd number of 128-byte records and (2) the
  BDOS returns from the last write-record call with carry set.  [Note of
  interest:  The CP/M 2.2 BDOS returns with carry set only if the output
  drive is different than the current default drive.  This assumes, of
  course, that no RSX-type system extensions are in place to intercept
  BDOS calls:  We would have caught this bug, but for such a system
  extension which always clears carry before returning from BDOS calls.]
  Our thanks to Tom Brady for reporting this one.

- Zero-fills last record of .COM file.  (Not needed with Z80ASM and/or
  SLRNK, but provided so that M80/L80 will generate identical output to
  that produced by the SLR Systems' tools.)

1.5	24 Mar 87  (RAF)

- UNARC is now distributed as a self-unpacking archive, UNARC15.ARK.
  This requires: (1) the non-z80 version (UNARCA.COM) must be the FIRST
  file in the archive, (2) UNARCA.COM must be stored in UNPACKED form
  using compression version 1, (3) the header for UNARCA.COM must be
  preceded by the SINGLE byte, 0C3H (opcode for unconditional jump),
  and (4) the archive must be copied or renamed to UNARCxx.COM on the
  current disk drive (xx = current version, i.e. UNARC15.COM for this
  release).  Then, the file is executed with a single optional parameter
  specifying the disk drive to use for extracting all files (defaults to
  current drive).  For example, assuming UNARC15.ARK is on drive B:

  A>B:				; Set current drive for UNARC15.ARK
  B>REN UNARC15.COM=UNARC15.ARK	; Rename it to UNARC15.COM
  B>UNARC15 [d:]		; Extract files to current drive [or d:]

- Corrects non-Z80 version emulation of the Z80 16-bit add and subtract
  instructions (ADC_HL and SBC_HL macros), to properly set the Z(ero)
  condition flag.  Previously, Z reflected only the upper byte of the
  16-bit result and was incorrect for non-zero results less than 256.
  This caused a serious bug (in the non-Z80 version, UNARCA.COM, only):
  Failure to output the last 1-255 bytes of an extracted file in cases
  where the final output buffer size was less than 256 bytes.  (In
  particular, ALL files less than 256 bytes in length could not be
  extracted.)  Thanks to Barry Kaufman (Multipath, Inc., P.O. Box 395,
  Montville, NJ  07045) for bringing this to our attention.  [This
  tends to confirm our opinion regarding the prevalence of non-Z80
  systems, since this bug has been present but unreported since the
  release of UNARC 1.2.]

- Alters the interpretation of the USELUX definition in the UNARCOVL.ASM
  overlay file.  USELUX = YES now restricts file typeout buffering to
  one page (equivalent to TYPGS = 1) instead of altering the upper TPA
  limit (CCPSV value).  This eliminates the LUXSIZ definition (which
  specified the size of the LUX RSX-type resident code) and avoids the
  confusion introduced by recent multiple new versions of LUX from
  different authors.

- Corrects CP/M 2.2 tab alignment for the first displayed line of file
  typeout after continuing from a screen pause ([more] message).

- Adds explicit check for CTRL-S (suspend output) in CABORT, to handle
  cases where standard CP/M 2.2 BDOS misses these.  (Also masks console
  input characters to 7 bits, in case this is not done, as it should be,
  by BIOS.  This is an attempt to solve reports of failure to recognize
  CTRL-C and CTRL-S on some systems.)

- Allows 0-length "crunched" files (i.e. with no code size byte).  [The
  various MS-DOS ARC utilities differ in their handling of 0-length
  files.  SEA's ARC generates unpacked (version 2), which we feel is
  esthetically best, and ARCA generates packed (version 3).  But PKARC
  generates crunched (version 8), which was regurgitated by earlier
  UNARC versions due to the absence of the code size byte.]

- Minor code improvements for version 1.42 changes.

- Eliminates DS directives at end of file to avoid wasted space when
  linked with L80 (as opposed to SLRNK, which handles trailing
  uninitialized data intelligently).  This also permits overlaying of
  the self-unpacking initialization code by data in the non-Z80 version.

1.42	07 Jan 87  (RAF)

Interim Beta-test release:

- Supports 'squashed' files (compression version 9) generated by PKARC
  version 2.0, as defined by Phil Katz' document file SQSHINFO.DOC,
  dated 12/27/86.  (Katz is certainly doing his best to make life
  interesting for us.)  Note:  We've made an educated guess that Katz'
  handling of bypassed output codes after adaptive reset is identical to
  that of crunched (version 8) files.  (Since there is no requirement
  for ARC512 compatibility here, he could have handled this in a less
  brain-damaged manner.  However, on the basis of two very limited test
  examples, our assumption appears to be true.)  This compression method
  requires a minimum TPA size of 30K (Z80) or 31K (8080) for extraction
  (worst case yet).

- Lists total of CRC values (mod 64K), as per suggestion of Steven
  Greenberg.  This provides a simple single checksum value for comparing
  files created by different archive programs.  (Since the CRC is
  computed over the UNcompressed files, this value should be the same
  for all archives created from the same set of input files, independent
  of any particular variations in file order or compression methods.)

- Adds trailing command line option 'C' to check the validity of one or
  more (or all, via *.*) archive members (i.e. to extract them for
  purposes of CRC and length checking, without storing them as disk
  files).  This is a quick hack, in response to a suggestion by Keith
  Petersen.  This option is currently allowed only if the wheel byte is
  non-zero.  I.e., it is ignored in restricted RCP/M versions (although
  there is no reason why this could not be allowed, subject to a Sysop-
  definable patch byte).  Also, the limited command line syntax prevents
  the simultaneous use of the 'N' option for non-paged typeout (i.e.
  screen pauses will always occur).  Both of these limitations will be
  eliminated with addition of enhanced command line processing
  (including du: user area syntax) in a future release.

- Disallows use of 'P' option for printing files in restricted (RCP/M)
  versions.  (We inadvertantly failed to implement this as intended in
  the 1.41 release.  Hopefully, the recipients of that release will
  honor our limited-distribution request!)  Note that the statement
  accompanying the 1.41 release is slightly incorrect:  Both 'P' and 'C'
  options are processed ONLY if the wheel byte is non-zero (and in the
  absence of an output file drive, which always causes extraction to a
  disk file); a zero HODRV byte does not, in itself, inhibit these.

- Makes .ARK the preferred default archive filetype.  I.e., first open
  attempt uses .ARK; second attempt tries .ARC if first is unsuccessful.

- Expands help message usage examples a bit (now that 4K limit is not a
  concern).

Note:  The additions in 1.41 and 1.42 have pushed the size of the Z80
version UNARC.COM file above 4K (which means 6K or 8K disk space on most
systems).  Such is life (and progress):  We've resisted this for a long
time, but it now seems unavoidable.  The UNARCOVL.ASM overlay file
distributed with UNARC 1.4 remains applicable for these releases.

1.41	14 Dec 86  (RAF)

Special limited-distribution release:

- Adds trailing command line option 'P' to allow printing of an archive
  member file on CP/M list device.  This is a quick hack, in response to
  a user request (Craig Arno, Seattle), to allow direct printing of
  highly-compressed binary plot images (e.g. 1+ MB files which crunch
  to < 5% of their original size).  Accordingly, ALL data is passed to
  the printer in 8-bit form, with no filtering by UNARC (including ^Z).
  This option is allowed under the same conditions as disk extraction
  (non-zero HODRV and wheel byte), and the files which may be printed
  are subject to the filetype exclusion table for typeout.

- Defers initializing listing totals until after CHECK is called.
  (This moved in 1.4 to accomodate LPS, without realizing it might
  cause a problem, albeit with an insignificant probability.  LPS
  is now allocated in code and cleared by CHECK.)

1.4	21 Nov 86  (RAF)

We had hoped NOT to release another update of this program, but to
replace it entirely by three new programs with enhanced functionality
(UNARK, ADIR, and ATYPE), in conjunction with the upcoming release of
the CP/M archive file builder (NOAH).  However, (sigh).....

Corrects bug (exhibited with .ARC's created by version 1.1 or later of
Phil Katz' PKARC program for MS-DOS) which caused files to be extracted
incorrectly (with output file length and CRC warnings) due to string
table reset codes appearing early in crunched files (i.e. before the
output code length reaches 12 bits).  Thanks to Keith Petersen for
identifying and notifying us of this problem.

And, while we're at it.....

Adds paging of all displayed output, controlled by non-zero patch byte
specifying screen lines between pauses (TYLPS, default value = 23).
This is essentially the feature added by 'Larry Smith' (see version 1.3
below), but we've been able to do it (with enhancements) and still keep
the (Z80 version) .COM file below 4K (just!).  Causes '[more]' message
to appear at bottom of screen.  Space bar scrolls one more line, ^C
aborts, anything else scrolls one more screenful.  (LINE FEED may be
used to avoid overprinting the '[more]' line.)  May be defeated for
continuous typeout by trailing 'N' (after a blank) on command line.

Also:

- If archive filetype omitted, and the default .ARC filetype not found,
  tries .ARK as an alternate default.  (Anticipates NOAH, and compatible
  with Irv Hoff's KMD22.)

- Incorporates option to bypass BDOS function 31 call (Get DPB Address),
  for non-std CP/M clones such as Cromemco CDOS and CP/M-68K emulator
  for 8080 CP/M 2.2.  (Eliminates UNARC12 patch notice, UNARC-P1.NOT.)

- Allows program name to be patched (at start of USAGE message).
  Affects all help screen references and abort message.  (E.g., RCP/M
  sysops may prefer 'ADIR' to 'UNARC'.)

- Corrects count of bytes skipped due to invalid header when processing
  'self-unpacking' archives with more than 3 preliminary bytes.

- Enhances recovery processing for invalid archive headers, and merges
  'invalid format' and 'unexpected eof' errors.  This change tends to
  cause display of a garbage directory entry (before abort) for non-ARC
  files, but it does allow processing of certain new self-unpacking
  archives, such as Phil Katz' PKX32A11.COM.

- Changes the replacement for an invalid filename char from '_' to '$'
  (since underline is not allowed as a filename char by CP/M CCP, and
  '$' usually carries a 'temporary' significance in CP/M).

- Reduces directory listing width by one column (78 now), to allow
  one more char without extra blank line on terminals which autowrap
  after column 80 (e.g. allows leading semicolon generated by MDM7 and
  IMP during disk file capture of terminal output).

- Adds a few bells to warning and fatal messages, along with a patch
  byte to disable these (for those who prefer solitude).

- Allows ^K in addition to ^C for program abort requests.  (For certain
  ancient RCP/M systems which never pass ^C back to user programs.)

- Adds .ARK and .?Z? (CP/M CRUNCH or MS-DOS ZOO 'Z format' files) to
  list of excluded typeout extensions, and eliminates .CMD (since that
  might be a readable dBASE command file instead of CP/M-86 binary).

- Simplifies the Z80 CPU check and removes the 'Z80 Version' message
  in the help display, to save a few bytes in that version.  (Alternate
  version, UNARCA.COM, now displays '8080 Version'.)

- Adds 8080 version message recommending Z80 version, when run on a Z80.

1.3	--none--   (RAF)

This version bypassed due to appearance of several unauthorized updates
with the name UNARC13 (and not because of superstition).  Most notably,
these include Steve Sanders' unnecessary addition of ^S and ^C checking
during file typeout (because TurboDOS does not properly emulate CP/M's
handling of these in BDOS function 2 calls), and the addition of paged
typeout by 'Larry Smith' (whoever he is; a worthwhile enhancement, but
the release was deficient in several other respects).  WHY CAN'T THESE
'CONTRIBUTORS' SIMPLY CONTACT THE AUTHOR BEFORE RELEASING THEIR CHANGES
TO THE PUBLIC?!

1.2	24 Jun 86  (RAF)

Modified to allow assembly of a version which will execute on 8080/8085
CPU's.  (We resisted this initially but have been made to realize that
this is necessary to achieve true acceptance of UNARC by the full CP/M
user community.  Non-Z80 users, particularly RCP/M sysops, still exert
considerable influence in the world of public domain software.  This,
we believe, is out of proportion to their numbers, since almost all
CP/M systems sold in the last five years are Z80-based.  Nevertheless,
we've accommodated the needs of these users by extensive use of macros
which serve to emulate Z80 instructions on non-Z80 machines.)  However,
no attempt has been made to optimize for either size or speed in the
non-Z80 version (which is 1K larger and 50% or more slower than its
Z80-only counterpart).

Also:

- Implements a "wheel" byte to simplify use and installation on RCP/M's.
- Lines up file types in directory listing.
- Permits processing of "self-unpacking" archives such as the MS-DOS
  ARC51.COM file (anticipates a future scheme for distributing UNARC).
- Attempts to recover from bad archive headers by skipping extra bytes.
- Eliminates archaic "T:" syntax completely for file typeout.

1.1	24 May 86  (RAF)

Minor change to allow file typeout without the "T:" syntax (which
didn't work with almost ANY CCP replacement)...  File will be typed if
it:  (1) has no disk drive name, (2) is a single (UNambiguous) file,
and (3) is not an excluded filetype.  (Else, file will simply be listed
with no error message.)  This change was suggested by Irv Hoff's mod to
UNARC10, which he called ADIR.  (Previous "T:" method can still be
enabled, but it is now undocumented since we will probably drop it
altogether in future.)

Also shortened on-line help message, so that COM file size is now
reduced to 4K.  (For RCP/M systems, if HODRV = 0 and/or TYFLG = 0, the
help information relating to disk extraction and/or file typeout,
respectively, is automatically removed.)

1.0	03 May 86  (RAF)

First public release.  Supports file formats generated by all versions
of MS-DOS ARC through (at least) version 5.12 dated February 7, 1986.

0.0	01 Mar 86  (RAF)

I undertook writing this program to satisfy my curiosity about software
developments in the MS-DOS/PC-DOS world.  The ARC "freeware" program
(copyright by System Enhancement Associates) has been around for over a
year now and has achieved enormous popularity in the 16-bit community.
Unfortunately, the lack of a compatible equivalent for CP/M systems
renders a large amount of public domain software inaccessible to 8-bit
users such as myself.  (Note that 16-bit software can indeed be usable
on 8-bit systems, e.g. Pascal and C language programs.)  Also, an
increasing number of RCP/M systems are catering to both 8-bit and
16-bit users, and it is my hope that UNARC may find a welcome home on
such systems.

Note that I was not (initially) a fan of the sequential .ARC file
format, which is less flexible and slower to process (though certainly
more compact) than the random-access format which Novosielski .LBR
libraries have provided for years.  Therefore, I stopped short of
producing a complete ARC program equivalent which includes creation of
.ARC files.  The LZW "crunching" algorithm is impressive though (see my
editorial comments preceeding the UCR routine), and I now believe there
is a place for .ARC files in the CP/M world (particularly on RCP/M's,
where the name of the game is reducing upload/download time).  But
that's the domain of another program (i.e. my next project: NARC).

					- Bob Freed
Credits:

Primary credit is due to System Enhancement Associates' ARC author
Thom Henderson for his fine utility program (even if it's not for
CP/M).  Of course without ARC, UNARC would have no reason to exist.
But special thanks are due SEA for making publicly available the C
language source code, without which we could never have begun. |

	PAGE
	SUBTTL	Z80/8080 Version Definitions

	.Z80			; Sorry, if you're an Intel fan
	.COMMENT |

This source program uses Zilog mnemonics (author's preference) and may
be assembled with the M80 ((C) Microsoft) or Z80ASM ((C) SLR Systems)
macro assemblers.  (Relocatable code features have been avoided, so
conversion to other assembler formats should be straightforward but
may require manual expansion of the macros defined here.)

The following macro definitions enable conditional assembly of a
version which will execute on 8080/8085 CPU's.  Our intent is to
provide a non-Z80 version without imposing a limitation on any
Z80-specific capabilities in the source.  (I.e., in specific cases the
chosen emulation of Z80 opcodes does not necessarily produce the
optimal 8080/8085 implementation, in terms of either size or speed.
This approach allows us to offer a non-Z80 version without worrying too
much about its efficiency.) |

NO	EQU	0
YES	EQU	NOT NO

; For Z80ASM only, the following may be left undefined to allow
; interactive definition at assembly time.  For M80 (which does not
; support the .ACCEPT directive), the leading semicolon must be removed
; in order to generate the non-Z80 version.

;Z80	EQU	NO		; YES for Z80 version, NO for 8080/8085

IFNDEF Z80			; If not defined above (and pass 1),

N	EQU	NO		; (Allows short
Y	EQU	YES		;  responses)

	.ACCEPT	Z80		; Ask user for definition (Z80ASM only)

IFNDEF Z80			; If still not defined (must be M80),
Z80	EQU	YES		; Generate the Z80 version
ENDIF

ENDIF

	PAGE
IF Z80

; Macros for Z80 version (to simplify our effort for the 8080 version)

EX_AF	MACRO
	EX	AF,AF'
ENDM

LD_DE	MACRO	AA
	LD	DE,AA
	ENDM

STO_DE	MACRO	AA
	LD	(AA),DE
	ENDM

STO_BC	MACRO	AA
	LD	(AA),BC
	ENDM

ADC_HL	MACRO	AA
	ADC	HL,AA
	ENDM

SBC_HL	MACRO	AA
	SBC	HL,AA
	ENDM

LD_IX	MACRO	AA
	LD	IX,AA
	ENDM

STO_IX	MACRO	AA
	LD	(AA),IX
	ENDM

PUSH_IX	MACRO
	PUSH	IX
	ENDM

POP_IX	MACRO
	POP	IX
	ENDM

INC_IX	MACRO
	INC	IX
	ENDM

ADD_IX	MACRO	AA
	ADD	IX,AA
	ENDM

LD_A_IX	MACRO
	LD	A,(IX)
	ENDM

ENDIF				; Z80
IF NOT Z80

; Macros for 8080 version (to emulate Z80-only opcodes)

; Note:  Many of these emulations of Z80 instructions do not correctly
; implement the setting of the condition flags (e.g. DJNZ should not
; alter the Z flag).  In all such cases, we have been careful to ensure
; that an exact emulation is not required anywhere in the code, but
; extreme vigilance is needed when making future program changes.
; (Exact emulation is always possible if necessary, so avoid trying to
; code around the differences:  Our goal should be to always produce the
; best possible Z80 version!)

JR	MACRO	AA,BB
IF NUL BB
	JP	AA
ELSE
	JP	AA,BB
ENDIF
	ENDM

DJNZ	MACRO	AA		; Destroys SF, ZF
	DEC	B
	JP	NZ,AA
	ENDM

EX_AF	MACRO
	PUSH	HL
	PUSH	AF
	LD	HL,(AFSAV)
	EX	(SP),HL
	LD	(AFSAV),HL
	POP	AF
	POP	HL
	ENDM

EXX	MACRO			; Long enough to warrant subroutine
	CALL	EXX
	ENDM

LD_DE	MACRO	AA
	EX	DE,HL
	LD	HL,AA
	EX	DE,HL
	ENDM

STO_DE	MACRO	AA
	EX	DE,HL
	LD	(AA),HL
	EX	DE,HL
	ENDM

STO_BC	MACRO	AA
	PUSH	HL
	LD	H,B
	LD	L,C
	LD	(AA),HL
	POP	HL
	ENDM

ADC_HL	MACRO	AA
	ADSBHL	AA,ADC
	ENDM

SBC_HL	MACRO	AA
	ADSBHL	AA,SBC
	ENDM

ADSBHL	MACRO	AA,BB
	PUSH	AF
	LD	A,L
CC	DEFL	NO
	IRPC	DD,AA
IF CC
	BB	A,DD
ENDIF
CC	DEFL	YES
	ENDM
	LD	L,A
	LD	A,H
	IRPC	DD,AA
	BB	A,DD
	EXITM
	ENDM
	LD	H,A
	JP	NZ,$+5		;; Test both bytes for zero,
	INC	L		;; without disturbing carry
	DEC	L		;; (added in UNARC 1.5)
	EX	(SP),HL
	LD	A,H
	POP	HL
	ENDM

LD_IX	MACRO	AA
	PUSH	HL
	LD	HL,AA
	LD	(IXSAV),HL
	POP	HL
	ENDM

STO_IX	MACRO	AA
	PUSH	HL
	LD	HL,(IXSAV)
	LD	(AA),HL
	POP	HL
	ENDM

PUSH_IX	MACRO
	PUSH	HL
	LD	HL,(IXSAV)
	EX	(SP),HL
	ENDM

POP_IX	MACRO
	EX	(SP),HL
	LD	(IXSAV),HL
	POP	HL
	ENDM

INC_IX	MACRO
	PUSH	HL
	LD	HL,(IXSAV)
	INC	HL
	LD	(IXSAV),HL
	POP	HL
	ENDM

ADD_IX	MACRO	AA
	PUSH	HL
	LD	HL,(IXSAV)
IFIDN <AA>,<IX>
	ADD	HL,HL
ELSE
	ADD	HL,AA
ENDIF
	LD	(IXSAV),HL
	POP	HL
	ENDM

LD_A_IX	MACRO
	PUSH	HL
	LD	HL,(IXSAV)
	LD	A,(HL)
	POP	HL
	ENDM

LDI	MACRO			; Does not handle P/V
	PUSH	AF
	LD	A,(HL)
	LD	(DE),A
	INC	HL
	INC	DE
	DEC	BC
	POP	AF
	ENDM

LDIR	MACRO			; Destroys CF
	CALL	LDIR
	ENDM

CPIR	MACRO			; Destroys CF, does not handle P/V
	CALL	CPIR
	ENDM

RLD	MACRO			; Not a true RLD, but suffices for us
	LD	A,(HL)
	RLCA
	RLCA
	RLCA
	RLCA
	ENDM

SRL	MACRO	AA
	OR	A
	SHIFT	AA,RRA
	ENDM

SRA	MACRO	AA
	SHIFT	AA,<RLCA,RRCA,RRA>
	ENDM

RR	MACRO	AA
	SHIFT	AA,RRA
	ENDM

RRC	MACRO	AA
	SHIFT	AA,RRCA
	ENDM

SHIFT	MACRO	AA,BB
IFDIF <AA>,<A>
	PUSH	AF
	LD	A,AA
ENDIF
	IRP	CC,<BB>
	CC
	ENDM
	INC	A		;; Set flags without
	DEC	A		;;  changing carry
IFDIF <AA>,<A>
	LD	AA,A
	EX	(SP),HL
	LD	A,H
	POP	HL
ENDIF
	ENDM

BIT	MACRO	AA,BB		; Destroys CF, SF
	PUSH	AF
IFDIF <BB>,<A>
	LD	A,BB
ENDIF
	AND	1 SHL AA
BITMSK	DEFL	$-1		;; For squashed files (c.f. STRADD)
	EX	(SP),HL
	LD	A,H
	POP	HL
	ENDM

SET	MACRO	AA,BB
	SETRES	AA,BB,OR
	ENDM

RES	MACRO	AA,BB
	SETRES	AA,BB,<AND NOT>
	ENDM

SETRES	MACRO	AA,BB,CC	; Destroys flags if register A
IFDIF <BB>,<A>
	PUSH	AF
	LD	A,BB
ENDIF
	CC	(1 SHL AA)
IFDIF <BB>,<A>
	LD	BB,A
	POP	AF
ENDIF
	ENDM

ENDIF				; NOT Z80
	PAGE
	SUBTTL	Definitions

; ARC file parameters

ARCMARK	EQU	26		; Archive header marker byte

; Note:	The following three definitions should not be changed lightly.
;	These are hard-wired into the code at numerous places!

ARCVER	EQU	9		; Max. header vers. supported for output
CRBITS	EQU	12		; Max. bits in crunched file input codes
CQBITS	EQU	13		; Max. bits in squashed file input codes

; CP/M system equates

BOOT	EQU	0000H		; Base of system page / warm boot return
BDOS	EQU	BOOT+005H	; BDOS entry
MEMTOP	EQU	BDOS+1		; Contains base of BDOS / top of TPA
DFCB	EQU	BOOT+05CH	; Command line tail default FCB
SFCB	EQU	BOOT+06CH	; Command line tail secondary FCB
DBUF	EQU	BOOT+080H	; Default DMA buffer
TBASE	EQU	BOOT+100H	; Base of TPA

; BDOS function codes

$CONIN	EQU	1		; Console input
$CONOUT	EQU	2		; Console output
$LIST	EQU	5		; Listing output
$PRTSTR	EQU	9		; Print (console) string
$CONST	EQU	11		; Get console status
$VERSN	EQU	12		; Get CP/M version no.
$SELECT	EQU	14		; Select disk
$OPEN	EQU	15		; Open file
$CLOSE	EQU	16		; Close file
$FIND	EQU	17		; Find file
$DELETE	EQU	19		; Delete file
$READ	EQU	20		; Read sequential record
$WRITE	EQU	21		; Write sequential record
$MAKE	EQU	22		; Make file
$DISK	EQU	25		; Get current disk
$SETDMA	EQU	26		; Set DMA address
$GETDPB	EQU	31		; Get disk parameter block address
$READR	EQU	33		; Read random record
$RECORD	EQU	36		; Set random record no.

; FCB offsets

@DR	EQU	0		; Drive code
@FN	EQU	1		; File name
@FT	EQU	9		; File type
@CR	EQU	32		; Current record
@RN	EQU	33		; Random record no. (optional)
@FCBSZ	EQU	33		; FCB size for sequential I/O
@FCBSX	EQU	@FCBSZ+3	; Extended FCB size for random I/O

	PAGE
; ASCII control codes

CTLC	EQU	'C'-'@'		; Control-C (console abort)
CTLK	EQU	'K'-'@'		; Control-K (alternate abort)
BEL	EQU	'G'-'@'		; Bell
HT	EQU	'I'-'@'		; Horizontal tab
LF	EQU	'J'-'@'		; Line feed
CR	EQU	'M'-'@'		; Carriage return
CTLS	EQU	'S'-'@'		; Control-S (suspend output)
CTLZ	EQU	'Z'-'@'		; Control-Z (CP/M end-of-file)
DEL	EQU	7FH		; Delete/rubout
REP	EQU	'P'-'@'+80H	; Repeated byte flag (DLE with msb set)

	PAGE
	SUBTTL	Patchable Options

; Useful options here at start of file to simplify patching

	ASEG			; This simplifies page alignment at end
	ORG	TBASE		; .COM file starts here

	JP	BEGIN		; Skip over this stuff on program entry

; The default values of all of these options are suitable for standard
; CP/M 2.2 systems.  In each case an alternate setting is illustrated,
; but these are primarily of interest to RCP/M sysops or users with
; non-standard (or very small) systems.  Options followed by ";*" are
; automatically affected by the wheel byte setting (see below).

CCPSV:	DB	8		; No. high memory pages to save (8 = 2K)
;CCPSV:	DB	0		; This to clobber CCP and force reboot

;BLKSZ:	DB	1		; Default disk allocation block size (K)
BLKSZ:	DB	0		;*This to use default drive's block size
				;  for listing, when no output drive

HIDRV:	DB	'P'-'@'		; Highest input file drive (A=1,B=2,...)
;HIDRV:	DB	0		; This restricts input to default drive

HODRV:	DB	'P'-'@'		;*Highest output file drive no.
;HODRV:	DB	0		; RCP/M's use this for no disk output
				;  (if no wheel byte implemented)

; Note:	As of UNARC 1.2, the following byte serves only as a flag.
;	I.e., it no longer defines a pseudo typeout "drive".

TYFLG:	DB	0FFH		; This enables single file typeout
;TYFLG:	DB	0		;*RCP/M's use this for no file typeout

TYPGS:	DB	0		;*No. buffer pages for typeout (0=max)
;TYPGS:	DB	1		; This minimizes viewing waits, but may
				;  cause excess floppy motor stop/start

TYLIM:	DB	0		; No line limit for file typeout
;TYLIM:	DB	80		;*RCP/M's may prefer non-zero line limit

; Following added in UNARC 1.2 to simplify use by RCP/M sysops.  If byte
; addressed by WHEEL is zero, no file output allowed (as if HODRV = 0).
; Also BLKSZ and/or TYPGS are assumed = 1, if these are zero by default.
; If byte addressed by WHEEL is non-zero (indicates a privileged user),
; TYFLG and TYLIM are not enforced (unlimited typeout allowed).  The
; default wheel byte address defined here (HODRV) provides compatibility
; with previous releases of UNARC for systems which do not implement a
; wheel byte.  (ZCPR3 users should set this word to the address of their
; Z3WHL byte, as determined by running SHOW.COM.)

WHEEL:	DW	HODRV		; Address of "wheel" byte (this if none)
;WHEEL:	DW	BOOT+03EH	; E.g. if wheel byte stored in base page

	PAGE
; Following added in UNARC 1.4:

TYLPS:	DB	23		; No. lines between typeout pauses
;TYLPS:	DB	0		; Forces continuous typeout always

DBLSZ:	DB	0		; Use DPB for disk allocation block size
;DBLSZ:	DB	1		; Assumed block size (K) if BDOS 31 call
				;  not supported (e.g. CP/M-68K)

BELLS:	DB	0FFH		; Allow bells in warning/error messages
;BELLS:	DB	0		; This for solitude

; Table of file types which are disallowed for typeout

NOTYP:	DB	'COM'		; CP/M-80 or MS-DOS binary object
	DB	'CM','D'+80H	; CP/M-86 binary object (or dBASE file)
	DB	'EXE'		; MS-DOS executable
	DB	'OBJ'		; Renamed COM
	DB	'OV?'		; Binary overlay
	DB	'REL'		; Relocatable object
	DB	'?RL'		; Other relocatables (PRL, CRL, etc.)
	DB	'INT'		; Intermediate compiler code
	DB	'SYS'		; System file
	DB	'BAD'		; Bad disk block
	DB	'LBR'		; Library
	DB	'ARC'		; Archive (unlikely in an ARC)
	DB	'ARK'		; Alternate archive (ditto)
	DB	'?Q?'		; Any SQueezed file (ditto)
	DB	'?Z?'		; Any CRUNCHed (or ZOO'd) file (ditto)

; Note:	Additional types may be added below.  To remove one of the above
;	types without replacing it, simply set the msb in any byte (as
;	shown above for .CMD, since that can be a readable dBASE command
;	file).

	REPT	5		; Room for more types (20 total)
	DB	0,0,0
	ENDM

	DB	0		; End of table

	PAGE
	SUBTTL	Program Usage

; Following displays if no command line parameters
; (Also on attempts to type the .COM file)

; Note:	All program name output is obtained from the first chars of the
;	usage message below (up to and including the first blank), and
;	is generated by a byte value 1 in any typeout string.

USAGE:	IDENT			; Program version identification first

	DB	CR,LF
	DB	'CP/M Archive File Extractor'
IF NOT Z80
USEA:	DB	' (8080 Version)'
ENDIF
	DB	CR,LF,LF,'Usage:  ',1,'[d:]arcfile[.typ] '

USE1:	DB	'[d:]'
USE1L	EQU	$-USE1		; Above cleared if HODRV=0 or non-wheel

	DB	'[afn] [N'

USE4:	DB	'|P|C'
USE4L	EQU	$-USE4		; Above cleared if non-wheel

	DB	']',CR,LF,LF
	DB	'Examples:',CR,LF
	DB	'B>',1,'A:SAVE.ARK *.*  '
	DB	'; List all files in CP/M archive SAVE on drive A',CR,LF
	DB	'B>',1,'A:SAVE.ARC *.*  '
	DB	'; List all files in MS-DOS archive SAVE on drive A',CR,LF
	DB	'A>',1,'SAVE            '
	DB	'; Same as either of above',CR,LF
	DB	'A>',1,'SAVE *.* N      '
	DB	'; Same as above (no screen pauses)',CR,LF
	DB	'A>',1,'SAVE *.DOC      '
	DB	'; List just .DOC files',CR,LF

USE2:	DB	'A>',1,'SAVE READ.ME    '
	DB	'; Typeout the file READ.ME',CR,LF
	DB	'A>',1,'SAVE READ.ME N  '
	DB	'; Typeout the file READ.ME (no screen pauses)',CR,LF
USE2L	EQU	$-USE2		; Above cleared if TYFLG=0 and non-wheel

USE3:	DB	'A>',1,'SAVE A:         '
	DB	'; Extract all files to drive A',CR,LF
	DB	'A>',1,'SAVE B:*.DOC    '
	DB	'; Extract .DOC files to drive B',CR,LF
	DB	'A>',1,'SAVE C:READ.ME  '
	DB	'; Extract file READ.ME to drive C',CR,LF
USE3L	EQU	$-USE3		; Above cleared if HODRV=0 or non-wheel

USE5:	DB	'A>',1,'SAVE PRN.DAT P  '
	DB	'; Print the file PRN.DAT (no formatting)',CR,LF
	DB	'A>',1,'SAVE *.* C      '
	DB	'; Check validity of all files in archive'
USEC:	DB	CR,LF
USE5L	EQU	$-USE5		; Above cleared if non-wheel

	DB	LF
	COPR			; Copyright notice last

; (We'd like to be unobtrusive, but please don't remove or patch out)

USEB:	DB	0		; End of message marker
	DB	CTLZ		; Stop attempted .COM file typeout here

	PAGE
	SUBTTL	Beginnings and Endings

IF NOT Z80
; Special entry for self-unpacking archive (non-Z80 version only)

; Note:	This works because the initial file (UNARCA.COM) in a self-
;	unpacking archive is offset 26 bytes in memory (due to the
;	initial JP opcode plus 25-byte version 1 header).  I.e., the
;	first three bytes of such a file are 0C3H, 1AH, 01H = JP 11AH.
;	Location 11AH contains the instruction normally found at the
;	base address (100H) of UNARCA.COM, i.e. JP BEGIN.  But because
;	of the offset, that will jump here instead of to BEGIN.

	JP	SELFUP		; Go setup for self-unpacking
	REPT	5		; Pad out for 26-byte offset...
	DB	0,0,0,0
	ENDM
	JP	BOOT		; (Should never reach this!)
ENDIF

; Program begins

; Note:	The program is self-initializing.  Once loaded, it may be
;	re-executed multiple times (e.g. by a zero-length COM file,
;	or the ZCPR GO command).

BEGIN:
;;;	XOR	A		; \ This sets Z80 P/V = 0 (no overflow),
;;;	DEC	A		; /  or 8080/8085 P/V = 1 (even parity)
	SUB	A		; (More elegant, saves a byte: v1.4)
	LD	C,$PRTSTR	; Setup to print message by BDOS
IF Z80
	LD	DE,NOTZ80	; Must be a Z80, or forget all else
	JP	PE,BDOS		; If not, just print message and abort
	LD	(SPSAV),SP	; Save CCP stack (better be a Z80 now!)
ELSE
	LD	DE,USEZ80	; Should be an 8080/8085
	CALL	PO,BDOS		; If not, tell user about Z80 version

BEGIN1:	LD	HL,0		; Entry after self-unpacking relocation
	ADD	HL,SP		; Save CCP stack (8080 or Z80)
	LD	(SPSAV),HL
ENDIF
	CALL	CHECK		; Check if we can proceed
	LD	SP,STACK	; Now setup local stack
	LD	HL,TOTS		; Zero all listing totals
	LD	BC,TOTC*256+0
	CALL	FILL
	CALL	INIT		; Process command line, open ARC file
	CALL	OUTSET		; Check output drive, setup for output

; Find first archive header

; Note:	As of UNARC 1.2, up to three additional bytes are tolerated
;	before first header mark, with no error or warning messages
;	(for "self-unpacking" archives).

	LD	HL,3		; Assume will skip at least 3 bytes
	LD	B,L		; Setup count of allowed extra bytes

FIRST:	CALL	GET		; Get next byte
	CP	ARCMARK		; Is it header marker?
	JR	Z,NEXT		; Yes, skip
	DJNZ	FIRST		; Else loop for no. allowed extras

	PAGE
; File processing loop

LOOP:	CALL	GET		; Get next byte
	CP	ARCMARK		; Is it archive header marker?
	JR	NZ,BAD		; No, it's a bad header

; Process next file

NEXT:	CALL	GET		; Get header version
	OR	A		; If zero, that's logical end of file,
	JR	Z,DONE		;  and we're done

NEXT1:	CALL	GETHDR		; Read archive header
	CALL	GETNAM		; Does file name match test pattern?
	JR	NZ,SKIP		; No, skip this file

	CALL	LIST		; List file info
	CALL	OUTPUT		; Output the file (possibly)
	CALL	TAMBIG		; Ambiguous output file selection?
	JR	NZ,EXIT		; No, quit early

; Skip to next file

SKIP:	LD	HL,SIZE		; Get two-word remaining file size
	CALL	LGET		; (will be 0 if output was completed)
	CALL	SEEK		; Seek past it
	LD	HL,0		; Reinit count of bytes skipped
	JR	LOOP		; Loop for next file

; Done with all files

DONE:	LD	HL,(TFILES)	; Get no. files processed
	LD	A,H
	OR	A
	JR	NZ,DONE1	; Skip if many

	OR	L		; No files found?
	LD	DE,NOFILS	; Yes, setup error message
	JR	Z,PABORT	;  and abort

	DEC	A		; Test if just one file

DONE1:	CALL	NZ,LISTT	; If more than one, list totals

; Exit program

EXIT:	CALL	ICLOSE		; Close input and output files (if open)
	LD	A,(CCPSV)	; Possibly overlaid CCP?
	OR	A
	JP	Z,BOOT		; Yes, reboot CP/M

	LD	SP,0		; Restore CCP stack
SPSAV	EQU	$-2		; (Original stack ptr saved here)
	RET			; Return to CCP

	PAGE
; Bad archive file header

; Note:	This added in UNARC 1.2 (mostly compatible with MS-DOS ARC
;	5.12) and modified somewhat in UNARC 1.4.  It's a bit kludgy
;	now, but it does permit processing of Phil Katz' self-unpacking
;	archive, PKX32A11.COM (with a warning message), as well as
;	SEA's ARC51.COM (with no warning).  (Although success with
;	PKX32A11 hinges on the fact that no ARCMARK's are followed
;	by valid non-zero versions in that file, which is probably
;	coincidental.)

BAD:	CALL	BADCNT		; Count bad header byte
	CALL	GET		; Read byte (unless end of file abort)

BAD1:	CP	ARCMARK		; Found a header marker?
	JR	NZ,BAD		; No, repeat attempt to re-synchronize

	CALL	GET		; Ok, found another (possible) header
	PUSH	AF		; Save header version
	DEC	A		; But ignore archive eof here
	CP	ARCVER		; Is it a valid version?
	JR	NC,BAD2		; No, skip

	EX	DE,HL		; Get count of bytes skipped
	LD	HL,HDRSKP	; Store in message
	LD	BC,0
	CALL	WTOD
	LD	(HL),0
	LD	DE,HDRERR	; Print warning message
	CALL	PRINTX
	POP	AF		; Recover version
	JR	NEXT1		; Go process (assumed valid) next file

BAD2:	CALL	BADCNT		; Count bad header byte (1st of 2 seen)
	POP	AF		; Restore vesion
	JR	BAD1		; Go check if 2 consecutive header marks

	PAGE
; Preliminary checks

; Note:	Following is called before local stack is setup.  Primary
;	caution	here is that PRINT (called by PABORT and PEXIT) uses no
;	more than 5 stack levels.  (Assumes program called from CCP with
;	7 stack levels available, and that at most one of these must be
;	reserved for interrupts.)

CHECK:	XOR	A		; Clear flags in case early abort:
	LD	(IFLAG),A	;  Input file open flag
	LD	(OFLAG),A	;  Output file open flag
	LD	(LPS),A		; Prevent any screen pauses yet

	LD	C,$VERSN	; Must be CP/M 2.0 or above, since we
	CALL	BDOS		;  use random disk reads
	CP	20H
	LD	DE,CPMERR	; (With a bit of work, this limitation
	JR	C,EABORT	;  could be eliminated in future)

	LD	A,(MEMTOP+1)	; Get base page of BDOS
	LD	HL,CCPSV	; Subtract no. pages reserved for CCP
	SUB	(HL)		;  (if any)
	LD	(HIPAGE),A	; Save highest usable page (+1)
	LD	A,HIGH MINMEM	; Ensure enough memory to do anything

; Check for enough memory

CKMEM:	CP	0		; Page address to check in A
HIPAGE	EQU	$-1		; Must be lower than this
	RET	C		; Return if ok

	LD	DE,NOROOM	; Else, abort due to no room

; Early abort during preliminary checks

EABORT:	POP	HL		; Reclaim stack level for extra safety

; Print error message and abort

PABORT:	CALL	PRINT

; Abort program

ABORT:	LD	DE,ABOMSG	; Print general abort message

; Print message and exit

; Note:	We call PRINT+CRLF, instead of PRINTX, to save a stack level

PEXIT:	CALL	PRINT
	CALL	CRLF
	JR	EXIT

	PAGE
; Validate command line parameters and open input file

INIT:	LD	HL,DBUF		; Point to command line buffer
	LD	E,(HL)		; Fetch its length
	LD	D,0
	ADD	HL,DE		; Point to the last byte
	DEC	HL		; Point to second-to-last char
	LD	A,(HL)		; Is it a blank?
	CP	' '
	JR	NZ,INIT1	; No, skip (no option)

	INC	HL		; Point to option letter
	LD	A,(HL)		; Is it 'N' ?
	CP	'N'
	JR	Z,INIT2		; Yes, skip (no paging)

	CP	'P'		; Is it 'P' ?
	JR	NZ,INIT0
	LD	(PROUTF),A	; Yes, set printer output flag

INIT0:	CP	'C'		; Is it 'C' ?
	JR	NZ,INIT1	; No, go enstate paging limit
	LD	(CHECKF),A	; Yes, set check archive flag

INIT1:	LD	A,(TYLPS)	; Fetch default lines between pauses
	LD	(LPS),A		; Set lines per screen (enables pauses)
	LD	(LPSCT),A	; Init count of lines until next pause

INIT2:	LD	A,' '		; Setup blank for (several) tests
	LD	HL,SFCB		; Point to second parameter FCB
	LD	DE,OFCB		; Point to file output FCB
	LDI			; Save output drive, point to file name
	LD	DE,TNAME	; Set to save test pattern
	LD	BC,11		; Setup count for file name and type
	CP	(HL)		; Output file name specified?
	JR	NZ,INIT3	; Yes, go move it

	LD	H,D		; No, default to "*.*"
	LD	L,E
	LD	(HL),'?'	; (I.e. all "?" chars)
	INC	DE
	DEC	BC

INIT3:	LDIR			; Save test name pattern
	LD	HL,IFCB+@FT	; Point to ARC file type
	CP	(HL)		; Omitted?
	JR	NZ,INIT4	; Skip if not

	LD	(HL),'A'	; Yes, set default file type (.ARK)
	INC	HL
	LD	(HL),'R'
	INC	HL
	LD	(HL),'K'
	LD	(ARKFLG),A	; Set flag for alternate (.ARC) next

INIT4:	LD	HL,IFCB+@FN	; Any ARC file name?
	CP	(HL)
	JR	Z,HELP		; No, go show on-line help

	PUSH	HL		; Save name ptr for message generation
	CALL	FAMBIG		; Ambiguous ARC file name?
	LD	DE,NAMERR	; Yes, report error
INIT5:	JR	Z,PABORT	;  and abort

	POP	DE		; Recover ptr to FCB name
	LD	HL,ARCNAM	; Unparse name for message
	LD	C,' '		; (with no blanks)
	CALL	LNAME
	XOR	A		; Cleanup end of message string
	LD	(HL),A

	DEC	A		; Set to read a new record next
	LD	(GETPTR),A	; (initializes GET)

	LD	HL,IFCB		; Point to ARC file FCB
	LD	A,(HIDRV)	; Get highest allowed drive no.
	CP	(HL)		; Is ARC file drive in range?
	LD	DE,BADIDR	; No, report bad input drive
	JP	C,PABORT	;  and abort

; Open archive file

	EX	DE,HL		; Recover FCB address
	LD	C,$OPEN		; Open ARC file
	CALL	FDOS		; File found?
	JR	NZ,INIT6	; Yes, skip

	LD	HL,ARKFLG	; No, but can we retry with alternate
	OR	(HL)		;  default file type?
	LD	DE,OPNERR	; No, report error
	JR	Z,INIT5		;  and abort (via branch aid)

	LD	(HL),0		; Clear retry flag for next time
	LD	HL,IFCB+@FT+2	; Point to last char of file type
	LD	(HL),'C'	; Change from .ARK to .ARC
	JR	INIT4		; Go attempt open one more time

INIT6:	LD	(IFLAG),A	; Set input file open flag
	LD	DE,ARCMSG	; Show ARC file name
	CALL	PRINTX

	LD	A,(BLKSZ)	; Get default disk block size
	OR	A		; Explicit default?
	CALL	Z,WHLCK		; Or non-wheel if none? (i.e. forces 1K)
	JR	NZ,SAVBLS	; Yes, skip

; Get current disk's allocation block size for listing

GETBLS:	LD	A,(DBLSZ)	; Any default disk block size?
	OR	A		; (e.g. if $GETDPB not supported)
	JR	NZ,SAVBLS	; Yes, bypass the $GETDPB call

	LD	C,$GETDPB	; Get DPB address
	CALL	BDOS
	INC	HL		; Point to block mask
	INC	HL
	INC	HL
	LD	A,(HL)		; Fetch block mask
	INC	A		; Compute block size / 1K bytes
	RRCA
	RRCA
	RRCA

SAVBLS:	LD	(LBLKSZ),A	; Save block size for listing
	RET			; Return

; Display program usage help message

HELP:	CALL	WHLCK		; Check wheel byte
	PUSH	AF		; Save it
	DEC	A		; Privileged user?
	JR	Z,HELP1		; No, skip (extraction never allowed)

	LD	A,(HODRV)	; File extraction allowed?
	OR	A

HELP1:	LD	HL,USE1		; Setup to clear out usage examples
	LD	BC,256*USE1L+80H
	CALL	Z,FILL		; Do it if not allowed
	LD	HL,USE3
	LD	B,USE3L
	CALL	Z,FILL		; (Two places)
	POP	AF		; Was wheel byte set?
	JR	Z,HELP2		; Yes, skip (typeout etc always allowed)

	LD	HL,USE4		; Clear out print/check option examples
	LD	B,USE4L
	CALL	FILL
	LD	HL,USE5		; (Two places)
	LD	B,USE5L
	CALL	FILL

	LD	A,(TYFLG)	; File typeout allowed?
	OR	A
	LD	HL,USE2
	LD	B,USE2L
	CALL	Z,FILL		; No, clear out usage example

HELP2:	LD	DE,USAGE	; Just print usage message
	JP	PEXIT		;  and exit

; Check wheel byte

WHLCK:	PUSH	HL		; Save register
	LD	HL,(WHEEL)	; Get wheel byte address
	LD	A,(HL)		; Fetch wheel byte
	POP	HL		; Restore reg
	OR	A		; Check wheel byte
	JR	NZ,WHLCK1

	INC	A		; If zero, user is not privileged
	RET			; Return A=1 (NZ)

WHLCK1:	XOR	A		; If non-zero, he's a big wheel
	RET			; Return A=0 (Z)

	PAGE
; Close input and output files (called at program exit)

ICLOSE:	LD	DE,IFCB		; Setup ARC file FCB
	LD	A,0		; Get input open flag
IFLAG	EQU	$-1		; (stored here)
	CALL	CLOSE		; Close input file first (e.g. for MP/M)

; Close output file

OCLOSE:	LD	DE,OFCB		; Setup output file FCB
	LD	A,0		; Get output open flag
OFLAG	EQU	$-1		; (stored here)

; Close a file if open

CLOSE:	OR	A		; File is open?
	LD	C,$CLOSE	; Yes, close it
	CALL	NZ,BDOS
	INC	A		; Check return code
	RET			; Return to caller (Z set if error)

; BDOS file functions for output file

OFDOS:	LD	DE,OFCB		; Setup output file FCB

; BDOS file functions

FDOS:	CALL	BDOS		; Perform function
	INC	A		; Test directory code
	RET			; Return (Z set if file not found)

; Set DMA address for file input/output

SETDMA:	LD	C,$SETDMA	; DMA address in DE
	CALL	BDOS		; This is always a good place to...

; Check for CTRL-C abort (and/or read console char if any)

CABORT:	LD	C,$CONST	; Get console status
	CALL	BDOS
	OR	A		; Character ready?
	RET	Z		; Return (Z set) if not

	LD	C,$CONIN	; Input console char (echo if printable)
	CALL	BDOS

; Note:	Following added in UNARC 1.5 to handle any ^S input which is not
;	detected by CP/M 2.2 BDOS.

	AND	7FH		; Mask to 7 bits
	CP	CTLS		; Is it CTRL-S (suspend output)?
	LD	C,$CONIN
	CALL	Z,BDOS		; Yes, wait for another char
	AND	7FH		; Mask to 7 bits

	CP	CTLC		; Is it CTRL-C?
	JR	Z,GABORT	; Yes, go abort

	CP	CTLK		; Or is it CTRL-K (RCP/M alternate ^C)?
	RET	NZ		; No, return char (and NZ) to caller

GABORT:	JP	ABORT		; Go abort program

	PAGE
	SUBTTL	Archive File Input Routines

; Get counted byte from archive subfile (saves alternate register set)

; The alternate register set normally contains values for the low-level
; output routines (see PUTSET).  This entry to GETC saves these and
; returns with them enstated (for PUT, PUTUP, etc.).  Caller must issue
; EXX after call to return these to the alternate set, and must save and
; restore any needed values from the original register set.

; Note:	At first glance, all this might seem unnecessary, since BDOS
;	(might be called by GETREC) does not use the Z80 alternate
;	register set (at least with Digital Research CP/M).  But some
;	CBIOS implementations (e.g. Osborne's) assume these are fair
;	game, so we are extra cautious here.

GETCX:	EXX			; Swap in alt regs (GETC saves them)

; Get counted byte from component file of archive

; GETC returns with carry set (and a zero byte) upon reaching the
; logical end of the current subfile.  (This relies on the GET routine
; NOT returning with carry set.)

GETC:	PUSH	BC		; Save registers
	PUSH	DE
	PUSH	HL
	LD	HL,SIZE		; Point to remaining bytes in subfile
	LD	B,4		; Setup for long (4-byte) size

GETC1:	LD	A,(HL)		; Get size
	DEC	(HL)		; Count it down
	OR	A		; But was it zero? (clears carry)
	JR	NZ,GET1		; No, go get byte (must not set carry!)

	INC	HL		; Point to next byte of size
	DJNZ	GETC1		; Loop for multi-precision decrement

	LD	B,4		; Size was zero, now it's -1

GETC2:	DEC	HL		; Reset size to zero...
	LD	(HL),A		; (SIZE must contain valid bytes to skip
	DJNZ	GETC2		;  to get to next subfile in archive)

	SCF			; Set carry to indicate end of subfile
	JR	GET2		; Go restore registers and return zero

	PAGE
; Get next sequential byte from archive file

; Note:	GET and SEEK rely on the fact that the default DMA buffer
;	used for file input (DBUF) begins on a half-page boundary.
;	I.e. DBUF address = nn80H (nn = 00 for standard CP/M).

GET:	PUSH	BC		; Save registers
	PUSH	DE
	PUSH	HL

GET1:	LD	HL,(GETPTR)	; Point to last byte read
	INC	L		; At end of buffer?
	CALL	Z,GETNXT	; Yes, read next record and reset ptr
	LD	(GETPTR),HL	; Save new buffer ptr
	LD	A,(HL)		; Fetch byte from there

GET2:	POP	HL		; Restore registers
	POP	DE
	POP	BC
	RET			; Return

; Get next sequential record from archive file

GETNXT:	LD	C,$READ		; Setup read-sequential function code

; Get record (sequential or random) from archive file

GETREC:	LD	DE,DBUF		; Point to default buffer
	PUSH	DE		; Save ptr
	PUSH	BC		; Save read function code
	CALL	SETDMA		; Set DMA address
	LD	DE,IFCB		; Setup FCB address
	POP	BC		; Restore read function
	CALL	BDOS		; Do it
	POP	HL		; Restore buffer ptr
	OR	A		; End of file?
	RET	Z		; Return if not

; Unexpected end of file

EOF:	LD	DE,FMTERR	; Print bad format message and abort
	JP	PABORT		; (not much else we can do)

; Count bytes skipped while processing bad archive header

BADCNT:	INC	HL		; Bump bad byte count
	LD	A,H		; But 64K bytes is enough!
	OR	L
	RET	NZ		; Return if not reached limit

	JR	EOF		; Else, report bad format and abort

	PAGE
; Seek to new random position in file (relative to current position)
; (BCDE = 32-bit byte offset)

SEEK:	LD	A,B		; Most CP/M (2.2) can handle is 23 bits
	OR	A		; So highest bits of offset must be 0
	JR	NZ,EOF		; Else, that's certainly past eof!

	LD	A,E		; Get low bits of offset in A
	LD	L,D		; Get middle bits in HL
	LD	H,C
	ADD	A,A		; LSB of record offset -> carry
	ADC_HL	HL		; Record offset -> HL
	JR	C,EOF		; If too big, report unexpected eof

	RRA			; Get byte offset
	EX	DE,HL		; Save record offset
	LD	HL,GETPTR	; Point to offset (+80H) of last byte in
	ADD	A,(HL)		; Add byte offsets
	LD	(HL),A		; Update buffer ptr for new position
	INC	A		; But does it overflow current record?
	JP	P,SEEK1		; Yes, skip

	LD	A,D		; Check record offset
	OR	E
	RET	Z		; Return if none (still in same record)

	DEC	DE		; Get offset from next record
	JR	SEEK2		; Go compute new record no.

SEEK1:	ADD	A,7FH		; Get proper byte offset in DMA page
	LD	(HL),A		; Save new buffer pointer

SEEK2:	PUSH	DE		; Save record offset
	LD	DE,IFCB
	LD	C,$RECORD	; Compute current "random" record no.
	CALL	BDOS		; (I.e. next sequential record to read)
	LD	HL,(IFCB+@RN)	; Get result
	POP	DE		; Restore record offset
	ADD	HL,DE		; Compute new record no.
	JR	C,EOF		; If >64k, it's past largest (8 Mb) file

	LD	(IFCB+@RN),HL	; Save new record no.
	LD	C,$READR	; Read the random record
	CALL	GETREC
	LD	HL,IFCB+@CR	; Point to current record in extent
	INC	(HL)		; Bump for subsequent sequential read
	RET			; Return

	PAGE
; Get archive file header

GETHDR:	LD	DE,HDRBUF	; Set to fill header buffer
	LD	B,HDRSIZ	; Setup normal header size
	CP	1		; But test if version 1
	PUSH	AF		; Save test result
	JR	NZ,GETHD2	; Skip if not version 1

	LD	B,HDRSIZ-4	; Else, header is 4 bytes less
	JR	GETHD2		; Go to store loop

GETHD1:	CALL	GET		; Get header byte

GETHD2:	LD	(DE),A		; Store in buffer
	INC	DE
	DJNZ	GETHD1		; Loop for all bytes

	POP	AF		; Version 1?
	RET	NZ		; No, all done

	LD	HL,SIZE		; Yes, point to compressed size
	LD	C,4		; It's 4 bytes
	LDIR			; Move to uncompressed length
	RET			; Return

	PAGE
; Get, save, and test file name from archive header

GETNAM:	LD	DE,NAME		; Point to name in header
	LD	HL,OFCB+@FN	; Point to output file name
	LD_IX	TNAME		; Point to test pattern
	LD	B,11		; Set count for name and type

GETN1:	LD	A,(DE)		; Get next name char
	AND	7FH		; Ensure no flags, is it end of name?
	JR	Z,GETN4		; Yes, go store blank

	INC	DE		; Bump name ptr
	CP	' '+1		; Is it legal char for file name?
	JR	C,GETN2		; No, if blank or non-printing,
	CP	DEL		;  or this
	JR	NZ,GETN3	; Skip if ok

GETN2:	LD	A,'$'		; Else, change to something legal

GETN3:	CALL	UPCASE		; Ensure it's upper case
	CP	'.'		; But is it type separator?
	JR	NZ,GETN5	; No, go store name char

	LD	A,B		; Get count of chars left
	CP	4		; Reached type yet?
	JR	C,GETN1		; Yes, bypass the separator

	DEC	DE		; Backup to re-read separator

GETN4:	LD	A,' '		; Set to store a blank

GETN5:	LD	(HL),A		; Store char in output name
	LD_A_IX			; Get pattern char
	INC_IX			; Bump pattern ptr
	CP	'?'		; Pattern matches any char?
	JR	Z,GETN6		; Yes, skip

	CP	(HL)		; Matches this char?
	RET	NZ		; Return (NZ) if not

GETN6:	INC	HL		; Bump store ptr
	DJNZ	GETN1		; Loop until FCB name filled

	LD	BC,256*(@FCBSZ-@FN-11)+0
	JP	FILL		; Zero rest of FCB, return (Z still set)

	PAGE
	SUBTTL	File Output Routines

; Check output drive and setup for file output

OUTSET:	LD	A,(HODRV)	; Get highest allowed output drive
	LD	B,A		; Save for later test
	LD	HL,CHECKF	; Point to check-only flag
	CALL	WHLCK		; Check wheel byte
	DEC	A		; Is user privileged?
	JR	NZ,OUTS1	; Yes, skip

	LD	B,A		; Else, no output drive allowed
	LD	(HL),A		; No checking allowed
	LD	(PROUTF),A	; No printing allowed
	LD	A,(TYFLG)	; Fetch flag for typeout allowed

OUTS1:	LD	C,A		; Save typeout flag (always if wheel)
	LD	A,(OFCB)	; Any output drive?
	OR	A
	JR	NZ,OUTS2	; Yes, skip to check it

	OR	(HL)		; Just checking files?
	JR	Z,CKTYP		; No, go see if typeout permitted

	LD	DE,CHKMSG	; Yes, show 'Checking...' message
	CALL	PRINTL
	LD	A,0FEH		; Set dummy drive in output FCB
	LD	(OFCB),A
	JR	CRCINI		; Skip to init CRC computations

OUTS2:	DEC	A		; Get zero-relative drive no.
	CP	B		; In range of allowed drives?
	LD	DE,BADODR	; No, report bad output drive
	JP	NC,PABORT	;  and abort

	LD	E,A		; Save output drive
	PUSH	DE
	ADD	A,'A'		; Convert to ASCII
	LD	(OUTDRV),A	; Store drive letter for message
	LD	DE,OUTMSG	; Show output drive
	CALL	PRINTL

	LD	C,$DISK		; Get default drive
	CALL	BDOS
	POP	DE		; Recover output drive
	CP	E		; Test if same as default
	PUSH	AF		; Save default drive (and test result)
	LD	C,$SELECT	; Select output drive
	CALL	NZ,BDOS		;  (if different than default)
	CALL	GETBLS		; Get its block size for listing
	POP	AF		; Restore original default drive
	LD	E,A
	LD	C,$SELECT	; Reselect it
	CALL	NZ,BDOS		;  (if changed)

	PAGE
; Initialize lookup table for CRC generation

; Note:	For maximum speed, the CRC routines rely on the fact that the
;	lookup table (CRCTAB) is page-aligned.

X16	EQU	0		; x^16 (implied)
X15	EQU	1 SHL (15-15)	; x^15
X2	EQU	1 SHL (15-2)	; x^2
X0	EQU	1 SHL (15-0)	; x^0 = 1

POLY	EQU	X16+X15+X2+X0	; Polynomial (CRC-16)

CRCINI:	LD	HL,CRCTAB+256	; Point to 2nd page of lookup table
	LD	A,H		; Check enough memory to store it
	CALL	CKMEM
	LD	DE,POLY		; Setup polynomial

; Loop to compute CRC for each possible byte value from 0 to 255

CRCIN1:	LD	A,L		; Init low CRC byte to table index
	LD	BC,256*8	; Setup bit count, clear high CRC byte

; Loop to include each bit of byte in CRC

CRCIN2:	SRL	C		; Shift CRC right 1 bit (high byte)
	RRA			; (low byte)
	JR	NC,CRCIN3	; Skip if 0 shifted out

	EX_AF			; Save lower CRC byte
	LD	A,C		; Update upper CRC byte
	XOR	D		;  with upper polynomial byte
	LD	C,A
	EX_AF			; Recover lower CRC byte
	XOR	E		; Update with lower polynomial byte

CRCIN3:	DJNZ	CRCIN2		; Loop for 8 bits

	LD	(HL),C		; Store upper CRC byte (2nd table page)
	DEC	H
	LD	(HL),A		; Store lower CRC byte (1st table page)
	INC	H
	INC	L		; Bump table index
	JR	NZ,CRCIN1	; Loop for 256 table entries

	RET

	PAGE
; Check for valid file name for typeout (or printing)

CKTYP:	OR	C		; Typeout not allowed?
	CALL	NZ,TAMBIG	; Or ambiguous output file name?
	RET	Z		; Yes, return (will just list file)

	LD	DE,NOTYP	; Point to table of excluded types

CKTYP1:	LD	HL,TNAME+8	; Point to type of selected file
	LD	B,3		; Setup count for 3 chars

CKTYP2:	LD	A,(DE)		; Fetch next table char
	OR	A		; End of table?
	JR	Z,CKTYP5	; Yes, go set flag to allow typeout

	CP	'?'		; Matches any char?
	JR	Z,CKTYP3	; Yes, skip

	CP	(HL)		; Matches this char?

CKTYP3:	INC	DE		; Bump table ptr
	JR	Z,CKTYP4	; Matched?
	DJNZ	CKTYP3		; No, just advance to next table entry
	JR	CKTYP1		; Then loop to try again

CKTYP4:	INC	HL		; Char matched, point to next
	DJNZ	CKTYP2		; Loop for all chars in file type
	RET			; If all matched, return (no typeout)

CKTYP5:	DEC	A		; If no match, file name is valid
	LD	(OFCB),A	; Set dummy drive (0FFH) in output FCB
	RET			; Return

; Test for ambiguous output file selection

TAMBIG:	LD	HL,TNAME	; Point to test pattern

; Check for ambiguous file name (HL = ptr to FCB-type name)

FAMBIG:	LD	BC,11		; Setup count for file name and type
	LD	A,'?'		; Any "?" chars?
	CPIR			; Yes, return with Z set
	RET			; No, return NZ

	PAGE
; Extract file for disk or console output

OUTPUT:	LD	A,(OFCB)	; Any output drive (or typing files)?
	OR	A
	RET	Z		; No, there's nothing to do here

	LD	B,A		; Save output drive
	LD	A,(VER)		; Get header version
	CP	ARCVER+1	; Supported for output?
	LD	DE,BADVER	; No, report unknown version
	JP	NC,PABORT	;  and abort

	LD	L,A		; Copy version
	LD	H,0
	LD	DE,OBUFT-1	; Use to index table of starting
	ADD	HL,DE		;  output buffer pages
	LD	A,(HL)		; Get starting page of buffer
	CALL	CKMEM		; Ensure enough memory
	LD	HL,BUFPAG	; Point to buffer start page
	LD	(HL),A		; Save it
	LD	C,A		; (also for typeout buffer check)
	INC	HL		; Point to buffer limit (BUFLIM)
	LD	A,(HIPAGE)	; Get memory limit page
	LD	(HL),A		; Assume max possible output buffer
	INC	B		; Typing files?
	JR	NZ,OUTDSK	; No, go extract to disk

; Setup for console (or printer) output

	LD	A,(TYPGS)	; Get max. pages to buffer typeout
	OR	A		; No limit?
	CALL	Z,WHLCK		; And is this privileged user?
	JR	Z,OUTCON	; Yes, skip (use 1 page if no privilege)

	ADD	A,C		; Compute desired limit page
	JR	C,OUTCON	; But skip if exceeds (physical) memory
	CP	(HL)
	JR	NC,OUTCON	; Also if exceeds available memory

	LD	(HL),A		; If ok, set lower buffer limit

OUTCON:	LD	A,(PROUTF)	; Printing file?
	OR	A
	JR	NZ,OUTBEG	; Yes, skip the separator

	LD	HL,LINE		; Fill listing line with dashes
	LD	BC,256*LINLEN+'-'
	CALL	FILL
	CALL	LISTL		; Print separating line first
	JR	OUTBEG		; Go extract file for typeout

	PAGE
; Setup for disk file (or black hole) output

OUTDSK:	INC	B		; Just checking file?
	JR	Z,OUTBEG	; Yes, skip

	LD	DE,BUFF		; Set DMA address to a safe place
	CALL	SETDMA
	LD	C,$FIND		; Find file
	CALL	OFDOS		; Already exists?
	JR	Z,OUTD2		; No, skip

	LD	DE,EXISTS	; Inform user and ask:
	CALL	PRINTS		; Should we overwrite existing file?

OUTD1:	CALL	CABORT		; Wait for response (or CTRL-C abort)
	JR	Z,OUTD1

	LD	E,A		; Save response
	CALL	CRLF		; Start a new line after prompt
	LD	A,E		; Get response char
	CALL	UPCASE		; Upper and lower case are the same
	CP	'Y'		; Answer was yes?
	RET	NZ		; No, return (skip file output)

	LD	C,$DELETE	; Yes, delete existing file
	CALL	OFDOS

OUTD2:	LD	C,$MAKE		; Create a new file
	CALL	OFDOS		; But directory full?
	LD	DE,DIRFUL	; Yes, report error
	JP	Z,PABORT	;  and abort

	LD	(OFLAG),A	; Set flag for output file open

	PAGE
; All set to output file

OUTBEG:	LD	A,(VER)		; Check compression type
	CP	4
	JR	NC,USQ		; Skip if squeezed or crunched/squashed

	CALL	PUTSET		; Else (simple cases), setup output regs
	CP	3		; Packed?
	JR	Z,UPK		; Yes, skip

; Uncompressed file

UNC:	CALL	GETC		; Just copy input to output
	JR	C,OUTEND	;  until end of file
	CALL	PUT
	JR	UNC

; Packed file

UPK1:	CALL	PUTUP		; Output with repeated byte expansion

UPK:	CALL	GETC		; Get input byte
	JR	NC,UPK1		; Loop until end of file

; End of output file

OUTEND:	CALL	PUTBUF		; Flush final buffer (if any)
	LD	A,(OFCB)	; Typing (or printing) file?
	INC	A
	RET	Z		; Yes, all done (no CRC check)

; Note:	Following instruction added in UNARC 1.6, since the preceding
;	test (altered in 1.42) no longer clears carry.

	OR	A		; Clear carry for 16-bit subtract
	EX	DE,HL		; Save computed CRC
	LD	HL,(CRC)	; Get CRC recorded in archive header
	SBC_HL	DE		; Do they match?
	LD	DE,CRCERR	; If not,
	CALL	NZ,OWARN	;  print warning message

	LD	HL,LEN		; Point to remaining (output) length
	CALL	LGET		; Fetch length (it's 4 bytes)
	LD	A,B		; All should be zero...
	OR	C
	OR	D
	OR	E
	LD	DE,LENERR	; If not,
	CALL	NZ,OWARN	;  print incorrect length warning

	CALL	OCLOSE		; Close output file (if open)
	LD	HL,OFLAG	; Clear file open flag
	LD	(HL),0
	RET	NZ		; Return unless error closing file

	LD	DE,CLSERR	; Else, report close failure
	JP	PABORT		;  and abort

	PAGE
; Unsqueeze (Huffman-coded) file

	.COMMENT |

Note:  Although numerous assembly-language implementations of Richard
Greenlaw's pioneer USQ (C language) program have appeared, all of the
coding here is original.  At risk of being accused of "re-inventing
the wheel," we do this primarily for personal satisfaction (not to
mention protection of our copyright).

We were tempted to use the super-fast algorithm suggested by Steven
Greenberg's recent public contribution, UF (aka USQFST, nee FU).
(After all, we require a Z80, so why not take advantage of the latest
technology?)  However, some of the speed benefit of Greenberg's method
is necessarily lost, since we do not buffer the input file and must
count each input byte against the file size recorded in the archive
header.  (Input buffering is not advantageous, since we must have
random access to the archive file.)  Also, the occurence of squeezed
files in archives is relatively rare, since the "crunching" method
produces better compression in most cases.  Thus we use a more
classical approach, albeit at the expense of the ultimate in
performance, but with a substantial savings in code complexity and
memory requirements.

Note also that many authors go to elaborate pains to check the validity
of the binary decoding tree.  Such checks include:  (1) the node count
(can be at most 256, although some people mistakenly think it can be
greater -- c.f. Knuth, vol. 1, 2nd ed., sec. 2.3.4.5, pp. 399-405); (2)
all node links in the tree must be in the range specified by the node
count; (3) no infinite loops in the tree (this one's not so easy to
test); and (4) premature end-of-file in the tree or data.  Instead, we
take a KISS approach which assumes the tree is valid and relies upon
the final output file CRC and length checks to warn of any possible
errors:  (1) the tree is initially cleared (all links point to the root
node); (2) at most 256 nodes are stored; and (3) decoding terminates
upon detecting the special end-of-file code in the data (the normal
case), the physical end-of-file (as determined by the size recorded in
the archive header), or a tree link to the root node (which indicates a
diseased tree). |

	PAGE
; Start unsqueezing

USQ:	JR	NZ,UCR		; But skip if crunched/squashed file

; First clear the decoding tree

	LD	BC,TREESZ-1	; Setup bytes to clear - 1
	CALL	TREECL		; (Leaves DE pointing past end of tree)

; Read in the tree

; Note:	The end-of-file condition may be safely ignored while reading
;	the node count and tree, since GETC will repeatedly return
;	zero bytes in this case.

	CALL	GETC		; Get node count, low byte
	LD	C,A		; Save for loop
	CALL	GETC		; Get high byte (can be ignored)
	OR	C		; But is it zero nodes?
	JR	Z,USQ3		; Yes (very unlikely), it's empty file

USQ1:	LD	B,4		; Setup count for 4 bytes in node
	LD	A,D		; Each byte will be stored in a separate
	SUB	B		;  page (tree is page-aligned), so
	LD	D,A		;  point back to the first page

USQ2:	CALL	GETC		; Get next byte
	LD	(DE),A		; Store in tree
	INC	D		; Point to next page
	DJNZ	USQ2		; Loop for all bytes in node

	INC	E		; Bump tree index
	DEC	C		; Reduce node count
	JR	NZ,USQ1		; Loop for all nodes

USQ3:	CALL	PUTSET		; Done with tree, setup output regs
	PUSH	HL		; Reset current input byte (on stack)

; Start of decoding loop for next output byte

USQ4:	EXX			; Save output registers
	XOR	A		; Reset node index to root of tree

; Top of loop for next input bit

USQ5:	LD	L,A		; Setup index of next tree node
	POP	AF		; Get current input byte
	SRL	A		; Shift out next input bit
	JR	NZ,USQ6		; Skip unless need a new byte

	PAGE
; Read next input byte

	PUSH	HL		; Save tree index
	CALL	GETCX		; Get next input byte
	EXX			; Save output regs
	JR	C,USQEND	; But go stop if reached end of input

	POP	HL		; Restore tree index
	SCF			; Set flag for end-of-byte detection
	RRA			; Shift out first bit of new byte

; Process next input bit

USQ6:	PUSH	AF		; Save input byte
	LD	H,HIGH TREE	; Point to start of current node
	JR	NC,USQ7		; Skip if new bit is 0

	INC	H		; Bit is 1, point to 2nd word of node
	INC	H		; (3rd tree page)

USQ7:	LD	A,(HL)		; Get low byte of node word
	INC	H
	LD	B,(HL)		; Get high byte (from next tree page)
	INC	B
	JR	NZ,USQ8		; Skip if high byte not -1

	CPL			; We've got output byte (complemented)
	EXX			; Restore regs for output
	CALL	PUTUP		; Output with repeated byte expansion
	JR	USQ4		; Loop for next byte

USQ8:	DJNZ	USQEND		; If high byte not 0, it's special EOF
	OR	A		; If high byte was 0, its new node link
	JR	NZ,USQ5		; Loop for new node (but can't be root)

; End of squeezed file (physical, logical, or due to Dutch elm disease)

USQEND:	POP	HL		; Cleanup stack

; End of unsqueezed or uncrunched file output

UCREND:	EXX			; Restore output regs
	JP	OUTEND		; Go end output

; Clear squeezed file decoding tree (or crunched file string table)

TREECL:	LD	HL,TREE		; Point to tree (also string table)

STRTCL:				; (Entry for partial string table clear)
	LD	(HL),L		; Clear first byte (it's page-aligned)
	LD	D,H		; Copy pointer to first byte
	LD	E,L
	INC	DE		; Propogate it thru second byte, etc.
	LDIR			; (called with BC = byte count - 1)
	RET			; Return

	PAGE
; Uncrunch (LZW-coded) file

	.COMMENT |

The Lempel-Ziv-Welch (so-called "LZW") data compression algorithm is
the most impressive benefit of ARC files.  It performs better than
Huffman coding in many cases, often achieving 50% or better compression
of ASCII text files and 15%-40% compression of binary object files.
The algorithm is named after its inventors:  A. Lempel and J. Ziv
provided the original theoretical groundwork, while Terry A. Welch
published an elegant practical implementation of their procedure.  (The
definitive article is Welch's "A Technique for High-Performance Data
Compression", in the June 1984 issue of IEEE Computer magazine.)

The Huffman algorithm encoded each input byte by a variable-length bit
string (up to 16 bits in Greenlaw's implementation), with bit length
(approximately) inversely proportional to the frequency of occurrence
of the encoded byte.  This has the disadvantages of requiring (1) two
passes over the input file for encoding and (2) the inclusion of the
decoding information along with the output file (a binary tree of up to
1026 bytes in Greenlaw's implementation).  In comparison, LZW is a one-
pass procedure which encodes variable-length strings of bytes by a
fixed-length code (12 bits in this implementation), without additional
overhead in the output file.  In essence, the procedure adapts itself
dynamically to the redundancy present in the input data.  There is one
drawback:  LZW requires substantially more memory than the Huffman
algorithm for both encoding and decoding.  (A 12K-byte string table is
required in this program; the MS-DOS ARC program uses even more.  Of
course, 12K is not that much these days:  I don't think they're even
selling IBM-PC's or MAC's with less than 512K anymore.  But some of us
in the CP/M world are still concerned with efficiency of memory use.)

The MS-DOS ARC program by System Enhancement Associates has (to date)
employed four different variations on the LZW scheme, differentiated by
the version byte in the archive file header:

     Version 5:  LZW applied to original input file
     Version 6:  LZW applied to file after packing repeated bytes
     Version 7:  Same as version 6 with a new (faster) hash code
     Version 8:  Completely new (much improved) implementation

The MS-DOS program PKARC 2.0 introduced another variation ("squashing"):

     Version 9:  Same as version 8 with 13-bit codes and no pre-packing

Version 8 (and 9) varies the output code width from 9 to 12 (13) bits
as the string table grows (benefits small files), performs an adaptive
reset of the string table after it becomes full if the compression
ratio drops (benefits large files), and eliminates the need for hash
computations by the decoder (reduces decoding time and space; in this
program, an extra 8K-byte table is eliminated).  Although the latest
release of the ARC program uses only this last version for encoding,
we, like ARC (PKXARC), support all four (five) versions for
compatibility with files encoded by earlier releases. |

	PAGE
; Setup for uncrunching (or unsquashing)

; We've been able to isolate all of the differences between the five
; versions of LZW into just three routines -- input, output, and hash
; function.  These are disposed of first, by inserting appropriate
; vectors into common coding and initializing version-dependent data.

; Note:	Introduction of squashed files in UNARC 1.42 has added some
;	extra kludges here.

UCR:	LD	HL,STRBIT	; All but version 9 use 4K string table
	LD	(HL),BIT4H	;  entries, so setup STRADD bit test
	CP	8		; Version 8 or 9?
	JR	NC,UCR1		; Yes, skip

	LD	DE,OGETCR	; Old versions get fixed 12-bit codes
	LD	BC,STRSZ+HSHSZ-1;  and need extra table for hashing
	LD	HL,OHASH	; Assume old hash function
	CP	6		; Test version
	LD	A,55H		; Setup initial flags for OGETCR
	JR	Z,UCR6		; All set if version 6
	JR	C,UCR5		; Skip if version 5

	LD	HL,FHASH	; Version 7 uses faster hash function
	JR	UCR6		; (but we've never seen one of these!)

UCR1:	JR	Z,UCR2		; Skip if version 8
	LD	(HL),BIT5H	; Version 9 allows 13-bit codes
	LD	BC,STQSZ-1	;  and has larger string table
	LD	A,8192/256	;  with 8K entries (less buffer space)
	JR	UCR4		; Join common code for versions 8 and 9

; Note:	This is the only place that we reference the code size for
;	crunched files (CRBITS) symbolically.  Currently, a value of
;	12 bits is required and it is assumed throughout the program.

UCR2:	CALL	GETC		; Read code size used to crunch file
	JR	C,UCR3		; But skip if none (PKARC 0-length file)
	CP	CRBITS		; Same as what we expect?
	LD	DE,UCRERR	; No, report incompatible format
	JP	NZ,PABORT	;  and abort

UCR3:	LD	BC,STRSZ-1	; Version 8 provides more buffer space
	LD	A,4096/256	;  and only 4K string table entries

UCR4:	LD	(STRMAX),A	; Setup NHASH table-full test
	LD	HL,0		; Clear code residue and count to init
	LD	(CODES),HL	;  NGETCR input (BITSAV and CODES)
	LD	DE,NGETCR	; New version has variable-length codes
	LD	HL,NHASH	;  and has a very simple "hash"
	LD	A,9		; Setup initial code size for NGETCR
	JR	Z,UCR6		; Skip if version 8

UCR5:	LD_IX	PUT		; Versions 5 and 9 don't unpack
	JR	UCR7

UCR6:	LD_IX	PUTUP		; Versions 6-8 unpack repeated bytes

UCR7:	STO_IX	PUTCRP		; Save ptr to output routine
	LD	(HASHP),HL	; Save ptr to hash function
	STO_DE	GETCRP		; Save ptr to input routine
	LD	(BITS),A	; Initialize input routine
	LD	A,B		; Get string table pages to clear (-1)
	SUB	3		; Less 3 for atomic strings
	LD	(STRCSZ),A	; Setup for reset clear in NGETCR

	PAGE
; Start uncrunching
; (All version-dependent differences are handled now)

	CALL	TREECL		; Clear string (and hash) table(s)
	STO_BC	STRCT		; Set no entries in string table
	DEC	BC		; Get code for no prefix string (-1)
	PUSH	BC		; Save as first-time flag
	XOR	A		; Init table with one-byte strings...

GCR0:	POP	BC		; Set for no prefix string
	PUSH	BC		; (Resave first-time flag)
	PUSH	AF		; Save byte value
	CALL	STRADD		; Add to table
	POP	AF		; Recover byte
	INC	A		; Done all 256 bytes?
	JR	NZ,GCR0		; No, loop for next

	CALL	PUTSET		; Setup output registers

; Top of loop for next input code (top of stack holds previous code)

GCR:	EXX			; Save output regs first

GETCR:	CALL	0		; Get next input code
GETCRP	EQU	$-2		; (ptr to NGETCR or OGETCR stored here)

	POP	BC		; Recover previous input code (or -1)
	JP	C,UCREND	; But all done if end of input

	PUSH	HL		; Save new code for next loop
	CALL	STRPTR		; Point to string table entry for code
	INC	B		; Is this the first one in file?
	JR	NZ,GCR2		; No, skip

	INC	HL		; Yes,
	LD	A,(HL)		; Get first output byte

GCR1:	CALL	PUTCR		; Output final byte for this code
	JR	GCR		; Loop for next input code

GCR2:	DEC	B		; Correct prev code (stays in BC awhile)
	LD	A,(HL)		; Is new code in table?
	OR	A
	PUSH	AF		; (Save test result for later)
	JR	NZ,GCR3		; Yes, skip

	LD	H,B		; Else (special case), setup previous
	LD	L,C		;  code (it prefixes the new one)
	CALL	STRPTR		; Point to its table entry instead

	PAGE
; At this point, we have the table ptr for the new output string (except
; possibly its final byte, which is a special case to be handled later).
; Unfortunately, the table entries are linked in reverse order.  I.e.,
; we are pointing to the last byte to be output.  Therefore, we trace
; through the table to find the first byte of the string, reversing the
; link order as we go.  When done, we can output the string in forward
; order and restore the original link order.  (This is, we think, an
; innovative approach: it saves allocation of an extra 4K-byte stack,
; as in the MS-DOS ARC program, or an enormous program stack, as needed
; for the recursive algorithm of Steve Greenberg's UNCRunch program.)

; Careful:  The following value must be non-zero, so that the old-style
; hash (invoked by STRADD below) will not think a re-linked entry is
; unused!  (In a development version, we used zero; this worked fine for
; newer crunched files, but proved a difficult bug to squash when the
; old-style de-crunching failed randomly.)

GCR3:	LD	D,1		; Init previous entry ptr (01xxH = none)

GCR4:	LD	A,(HL)		; Test this entry
	CP	HIGH STRT	; Any prefix string?
	JR	C,GCR5		; No, we've reached the first byte

	LD	(HL),D		; Relink this entry
	LD	D,A		; (i.e. swap prev ptr with prefix ptr)
	DEC	HL
	LD	A,(HL)
	LD	(HL),E
	LD	E,A
	INC	HL
	EX	DE,HL		; Swap current ptr with prefix ptr
	JR	GCR4		; Loop for next entry

; HL points to table entry for first byte of output string.  We can now
; add the table entry for the string which the encoder placed in his
; table before sending us the current code.  (It's the previous code's
; string concatenated with the first byte of the new string).  Note that
; BC has been holding the previous code all this time.

GCR5:	INC	HL		; Point to byte
	POP	AF		; Recover special-case flag
	LD	A,(HL)		; Fetch byte
	PUSH	AF		; Re-save flag along with byte
	DEC	HL		; Restore table ptr
	PUSH	DE		; Save ptr to prev entry
	PUSH	HL		; Save ptr to this entry
	CALL	STRADD		; Add new code to table (for BC and A)
	POP	HL		; Setup table ptr for output loop

	PAGE
; Top of string output loop
; HL points to table entry for byte to output.
; Top of stack contains pointer to next table entry (or 01xxH).

GCR6:	INC	HL		; Point to byte
	LD	A,(HL)		; Fetch it
	PUSH	HL		; Save table ptr
	CALL	PUTCR		; Output the byte (finally!)
	EXX			; Save output regs
	POP	DE		; Recover ptr to this byte
	POP	HL		; Recover ptr to next byte's entry
	DEC	H		; Reached end of string?
	JR	Z,GCR7		; Yes, skip out of loop

	INC	H		; Correct next entry ptr from above test
	DEC	DE		; Restore ptr to this entry's mid byte
	LD	A,(HL)		; Relink the next entry
	LD	(HL),D		; (i.e. swap its "prefix" ptr with
	LD	D,A		;  ptr to this entry)
	DEC	HL
	LD	A,(HL)
	LD	(HL),E
	LD	E,A
	INC	HL
	PUSH	DE		; Save ptr to 2nd next entry
	JR	GCR6		; Loop to output next byte

; End of uncrunching loop
; All bytes of new string have been output, except possibly the final
; byte (which is the same as the first byte in this special case).

GCR7:	POP	AF		; Recover special-case flag and byte
	JR	NZ,GETCR	; If not set, loop for next input code

	JR	GCR1		; Else, go output final byte first

	PAGE
; Add entry to string table

; This routine receives a 12-bit prefix string code in BC and a suffix
; byte in A.  It then adds an entry to the string table (unless it's
; full) for the new string obtained by concatenating these.  Nothing
; is (or need be) returned to the caller.

	.COMMENT |

String table format:

The table (STRT) contains 4096 three-byte entries, each of which is
identified by a 12-bit code (table index).  The third byte (highest
address) of each entry contains the suffix byte for the string.  The
first two bytes contain a pointer (low-byte first) to the middle byte
of the table entry for the prefix string.  The null string (prefix to
the one-byte strings) is represented by a (16-bit) code value -1, which
yields a non-zero pointer below the base address of the table.  An
empty table entry contains a zero prefix pointer.

Our choice to represent prefix strings by pointers rather than codes
speeds up almost everything we do.  The high byte of the prefix pointer
(middle byte of an entry) may be tested for non-zero to determine if an
entry is occupied, and (since the table is page-aligned) it may be
further tested against the page address of the table's base (HIGH STRT)
to decide if it represents the null string.

Note that the entry for code 256 is not used in the newer version of
crunching.  This is reserved for a special signal to reset the string
table (handled by the hash and input routines, NHASH and NGETCR). |

STRADD:	LD	HL,(STRCT)	; Get count of strings in table
	BIT	4,H		; Is it the full 4K?

; Note:	Above test complicated by introduction of squashed files (which
;	allow 13-bit codes and 8K string table entries) and the non-Z80
;	emulation of the BIT instruction.  Following definitions handle
;	this.

IF Z80
STRBIT	EQU	$-1		; Byte to modify BIT instruction
BIT4H	EQU	64H		; High byte of BIT 4,H
BIT5H	EQU	6CH		; High byte of BIT 5,H
ELSE
STRBIT	EQU	BITMSK		; Byte to modify emulated BIT
BIT4H	EQU	1 SHL 4		; Mask to test bit 4
BIT5H	EQU	1 SHL 5		; Mask to test bit 5
ENDIF
	RET	NZ		; Yes, forget it

	INC	HL		; Bump count for one more
	LD	(STRCT),HL	; Save new string count
	PUSH	AF		; Save suffix byte
	PUSH	BC		; Save prefix code
	CALL	0		; Hash them to get pointer to new entry
HASHP	EQU	$-2		; (ptr to xHASH routine stored here)
	EX	(SP),HL		; Save result, recover prefix code
	CALL	STRPTR		; Get pointer to prefix entry
	EX	DE,HL		; Save it
	POP	HL		; Recover new entry pointer
	DEC	HL		; Point to low byte of entry
	LD	(HL),E		; Store prefix ptr in entry
	INC	HL		;  (low byte first)
	LD	(HL),D		;  (then high byte, in mid entry byte)
	INC	HL		; Point to high byte of new entry
	POP	AF		; Recover suffix byte
	LD	(HL),A		; Store
	RET			; All done

	PAGE
; Hash function for (new-style) crunched files

; Note:	"Hash" is of course a misnomer here, since strings are simply
;	added to the table sequentially with the newer crunch method.
;	This routine's main responsibility is to update the bit-length
;	for expected input codes, and to bypass the table entry for
;	code 256 (reserved for adaptive reset), at appropriate times.

NHASH:	LD	A,L		; Copy low byte of string count in HL
	DEC	L		; Get table offset for new entry
	OR	A		; But is count a multiple of 256?
	JR	NZ,STRPTR	; No, just return the table pointer

	LD	A,H		; Copy high byte of count
	DEC	H		; Complete double-register decrement
	LD	DE,STRCT	; Set to bump string count (bypasses
	JR	Z,NHASH1	;  next entry) if exactly 256

	CP	4096/256	; Else, is count the full 4K?
STRMAX	EQU	$-1		; (Byte to modify max string count test)
	JR	Z,STRPTR	; Yes (last table entry), skip

; Note the following cute test.  (It's mentioned in K & R, ex. 2-9.)

	AND	H		; Is count a power-of-two?
	JR	NZ,STRPTR	; No, skip

	LD	DE,BITS		; Yes, next input code is one bit longer

; Note:	By definition, there can be no input code residue at this point.
;	I.e. (BITSAV) = 0, since we have read a power-of-two (> 256) no.
;	of codes at the old length (total no. of bits divisible by 8).
;	By the same argument, (CODES) = 0 modulo 8 (see NGETCR).

NHASH1:	EX	DE,HL		; Swap in address value to increment
	INC	(HL)		; Bump the value (STRCT or BITS)
	EX	DE,HL		; Recover table offset

; Get pointer to string table entry

; This routine is input a 12-bit code in HL (or -1 for the null string).
; It returns a pointer in HL to the middle byte of the string table
; entry for that code (STRT-2 for the null string).  Destroys DE only.

STRPTR:	LD	D,H		; Copy code
	LD	E,L
	ADD	HL,HL		; Get 2 * code
	ADD	HL,DE		; Get 3 * code
	LD	DE,STRT+1	; Point to table base entry (2nd byte)
	ADD	HL,DE		; Compute pointer
	RET			; Return

	PAGE
; Get variable-length code from (new-style) crunched file

	.COMMENT |

These codes are packed in right-to-left order (lsb first).  The code
length (stored in BITS) begins at 9 bits and increases up to a maximum
of 12 bits (13 bits for squashed files) as the string table grows
(maintained by NHASH).  Location BITSAV holds residue bits remaining in
the last input byte after each call (must be initialized to 0, code
assumes BITSAV = BITS-1).

In comparison, the MS-DOS ARC program buffers 8 codes at a time (i.e.
n bytes, where n = bits/code) and flushes this buffer whenever the code
length changes (so that first code at new length begins on an even byte
boundary).  By coincidence (see NHASH) this buffer is always empty when
the code length increases as a result of normal string table growth.
Thus the only time this added bufferring affects us is when the code
length is reset back to 9 bits upon receipt of the special clear
request (code 256), at which time we must possibly bypass up to 10
input bytes (worst case = 7 codes at 1.5 bytes/code).  This is handled
by a simple down-counter in location CODES, whose mod-8 value indicates
the no. of codes which should be skipped (must be initialized to 0,
code assumes that CODES = BITSAV-1). |

; Note:	This can probably be made a lot faster (e.g. by unfolding into
;	8 separate cases and using a co-routine return), but that's a
;	lot of work.  For now, we KISS ("keep it short and simple").

NGETCR:	LD	HL,CODES	; First update code counter
	DEC	(HL)		;  for clear code processing
	INC	HL		; Point to BITSAV
	LD	A,(HL)		; Get saved residue bits
	INC	HL		; Point to BITS
	LD	B,(HL)		; Setup bit counter for new code
	LD	HL,7FFFH	; Init code (msb reset for end detect)

; Top of loop for next input bit

NGETC1:	SRL	A		; Shift out next input bit
	JR	Z,NGETC7	; But skip out if new byte needed

NGETC2:	RR	H		; Shift bit into high end of code word
	RR	L		;  (double-register shift)
	DJNZ	NGETC1		; Loop until have all bits needed

; Input complete, cleanup code word

NGETC3:	SRL	H		; Shift code down,
	RR	L		;  to right-justify it in HL
	JR	C,NGETC3	; Loop until end flag shifted out

	LD	(BITSAV),A	; Save input residue for next call
	LD	A,H		; But is it code 256?
	DEC	A		; (i.e. adaptive reset request)
	OR	L
	RET	NZ		; No, return (carry clear)

; Special handling to reset string table upon receipt of clear code

	LD	HL,BITS		; Point to BITS
	LD	C,(HL)		; Fetch current code length
	LD	(HL),9		; Go back to 9-bit codes
	DEC	HL		; Point to BITSAV
	LD	(HL),A		; Empty the residue buffer
	DEC	HL		; Point to CODES
	LD	A,(HL)		; Get code counter
	AND	7		; Modulo 8 is no. codes to flush
	JR	Z,NGETC6	; Skip if none

; Note:	It's a shame we have to do this at all.  With a minor change in
;	its implementation, the MS-DOS ARC program could have simply
;	shuffled down its buffer and avoided wasting up to 10 bytes in
;	the crunched file (not to mention a lot of unnecessary effort).

; Note:	Prior to UNARC 1.4, the following coding was simplified by the
;	(incorrect) assumption that 12-bit codes are being generated at
;	this point.  While true for .ARC files created by ARC 5.12 or
;	earlier, this is not necessarily the case for files created by
;	PKARC 1.1 or later.  Hence, some added effort here now...

	LD	B,A		; Save no. codes to flush
	XOR	A		; Reset no. bits to flush
	LD	(HL),A		; Reset code counter to 0 for next time

NGETC4:	ADD	A,C		; Add no. bits per code
	DJNZ	NGETC4		; Loop to compute total bits to flush

	RRA			; Divide by 8
	RRA
	RRA
	AND	0FH		; Max possible result 10 (11 squashed)
	LD	B,A		; Obtain no. input bytes to bypass

NGETC5:	PUSH	BC		; Loop to flush the (encoder's) buffer
	CALL	GETCX
	EXX			; (No need to test for end-of-file
	POP	BC		;  here, we'll pick it up later if
	DJNZ	NGETC5		;  it happens)

NGETC6:	LD	HL,STRT+(3*256)	; Clear out (all but one-byte) strings
	LD	BC,STRSZ-(3*256)-1
STRCSZ	EQU	$-1		; (Byte to modify string tbl clear size)
	CALL	STRTCL
	LD	HL,257		; Reset count for just one-byte strings
	LD	(STRCT),HL	;  plus the unused entry

; Kludge:  We rely here on the fact that the previous input code is at
;	top of caller's stack, where -1 indicates none.  This should
;	properly be done by the caller, but doing it here preserves
;	commonality of coding for old-style crunched files (i.e. caller
;	never knows this happened).

	POP	HL		; Get return address
	EX	(SP),HL		; Exchange with top of (caller's) stack
	LD	HL,-1		; Set no previous code
	EX	(SP),HL		; Replace on stack
	PUSH	HL		; Restore return
	JR	NGETCR		; Go again for next input code

; Read next input byte

NGETC7:	PUSH	BC		; Save bit count
	PUSH	HL		; Save partial code
	CALL	GETCX		; Get next input byte
	EXX			; Save output regs
	POP	HL		; Restore code
	POP	BC		; Restore count
	RET	C		; But stop if reached end of file

; Special test to speed things up a bit...
; (If need the whole byte, might as well save some bit fiddling)

	BIT	3,B		; At least 8 more bits needed?
	JR	NZ,NGETC8	; Yes, go do it faster

	SCF			; Else, set flag for end-of-byte detect
	RRA			; Shift out first bit of new byte
	JR	NGETC2		; Go back to bit-shifting loop

; Update code by (entire) new byte

NGETC8:	LD	L,H		; Shift code down 8 bits
	LD	H,A		; Insert new byte into code
	LD	A,B		; Get bit count
	SUB	8		; Reduce by 8
	LD	B,A		; Update remaining count
	JR	NZ,NGETC7	; Get another byte if still more needed

	JR	NGETC3		; Else, go exit early (note A=0)

	PAGE
; Hash functions for (old-style) crunched files

; This stuff exists for the sole purpose of processing files which were
; created by older releases of MS-DOS ARC (pre-version 5.0).  To quote
; that program's author:  "Please note how much trouble it can be to
; maintain upwards compatibility."  Amen!

; Note:	The multiplications required by the two hash function versions
;	are sufficiently specialized that we've hand-coded each of them
;	separately, for speed, rather than use a common multiply
;	subroutine.

; Versions 5 and 6...
; Compute hash key = upper 12 of lower 18 bits of unsigned square of:
;	(prefix code + suffix byte) OR 800H

; Note:	I'm sure there's a faster way to do this, but I didn't want to
;	exert myself unduly for an obsolete crunching method.

OHASH:	LD	DE,0		; Clear product
	LD	L,A		; Extend suffix byte
	LD	H,D		;  to 16 bits
	ADD	HL,BC		; Sum with prefix code
	SET	3,H		; Or in 800H

; We now have a 13-bit number which is to be squared, but we are only
; interested in the lower 18 bits of the 26-bit product.  The following
; reduces this to a 12-bit multiply which yields the correct product
; shifted right 2 bits.  This is acceptable (we discard the low 6 bits
; anyway) and allows us to compute desired result in a 16-bit register.

; For the algebraically inclined...
;   If n is even (n = 2m + 0):  n * n = 4(m * m)
;   If n is odd  (n = 2m + 1):  n * n = 4(m * (m+1)) + 1

	SRA	H		; Divide number by 2 (i.e. "m")
	RR	L		; HL will be multiplicand (m or m+1)
	LD	C,H		; Copy to multiplier in C (high byte)
	LD	A,L		;  and A (low byte)
	ADC_HL	DE		; If was odd, add 1 to multiplicand

; Note there is one anomalous case:  The first one-byte string (with
; prefix = -1 = 0FFFFH and suffix = 0) generates the 16-bit sum 0FFFFH,
; which should hash to 800H (not 0).  The following test handles this.

	JR	C,OHASH3	; Skip if special case (will get 800H)
	LD	B,12		; Setup count for 12 bits in multiplier

; Top of multiply loop (vanilla shift-and-add)

OHASH1:	SRL	C		; Shift out next multiplier bit
	RRA
	JR	NC,OHASH2	; Skip if 0

	EX	DE,HL		; Else, swap in product
	ADD	HL,DE		; Add multiplicand (carries ignored)
	EX	DE,HL		; Reswap

OHASH2:	ADD	HL,HL		; Shift multiplicand
	DJNZ	OHASH1		; Loop until done all multiplier bits

; Now have the desired hash key in upper 12 bits of the 16-bit product

	EX	DE,HL		; Obtain product in HL
	ADD	HL,HL		; Shift high bit into carry

OHASH3:	RLA			; Shift up 4 bits into A...
	ADD	HL,HL
	RLA
	ADD	HL,HL
	RLA
	ADD	HL,HL
	RLA
	LD	L,H		; Move down low 8 bits of final result
	JR	HASH		; Join common code to mask high 4 bits

; Version 7 (faster)...
; Compute hash key = lower 12 bits of unsigned product:
;	(prefix code + suffix byte) * 15073

FHASH:	LD	L,A		; Extend suffix byte
	LD	H,0		;  to 16 bits
	ADD	HL,BC		; Sum with prefix code

; Note:	15073 = 2785 mod 4096, so we need only multiply by 2785.

	LD	D,H		; Copy sum, and compute in HL:
	LD	E,L		;    1 * sum
	ADD	HL,HL		;    2 * sum
	ADD	HL,HL		;    4 * sum
	ADD	HL,DE		;    5 * sum
	ADD	HL,HL		;   10 * sum
	ADD	HL,HL		;   20 * sum
	ADD	HL,DE		;   21 * sum
	ADD	HL,HL		;   42 * sum
	ADD	HL,DE		;   43 * sum
	ADD	HL,HL		;   86 * sum
	ADD	HL,DE		;   87 * sum
	ADD	HL,HL		;  174 * sum
	ADD	HL,HL		;  348 * sum
	ADD	HL,HL		;  696 * sum
	ADD	HL,HL		; 1392 * sum
	ADD	HL,HL		; 2784 * sum
	ADD	HL,DE		; 2785 * sum
	LD	A,H		; Setup high byte of result

; Common code for old-style hashing

HASH:	AND	0FH		; Mask hash key to 12 bits
	LD	H,A
	PUSH	HL		; Save key as trial string table index
	CALL	STRPTR		; Point to string table entry
	POP	DE		; Restore its index
	LD	A,(HL)		; Is table entry used?
	OR	A
	RET	Z		; No (that was easy), return table ptr

; Hash collision occurred.  Trace down list of entries with duplicate
; keys (in auxilliary table HSHT) until the last duplicate is found.

	LD	BC,HSHT		; Setup collision table base
	PUSH	HL		; Create dummy stack level

HASH1:	POP	HL		; Discard last index
	EX	DE,HL		; Get next trial index
	PUSH	HL		; Save it
	ADD	HL,HL		; Get ptr to collision table entry
	ADD	HL,BC
	LD	E,(HL)		; Fetch entry
	INC	HL
	LD	D,(HL)
	LD	A,D		; Is it zero?
	OR	E
	JR	NZ,HASH1	; No, loop for next in chain

; We now have the index (top of stack) and pointer (HL) for the last
; entry in the duplicate key list.  In order to find an empty spot for
; the new string, we search the string table sequentially starting 101
; (circular) entries past that of the last duplicate.

	EX	(SP),HL		; Save collision ptr, swap its index
	LD	E,101		; Move 101 entries past it
	ADD	HL,DE

HASH2:	RES	4,H		; Mask table index to 12 bits
	PUSH	HL		; Save index
	CALL	STRPTR		; Point to string table entry
	POP	DE		; Restore its index
	LD	A,(HL)		; Fetch byte from entry
	OR	A		; Is it empty?
	JR	Z,HASH3		; Yes, found a spot in table

	EX	DE,HL		; Else,
	INC	HL		; Bump index to next entry
	JR	HASH2		; Loop until we find one free

; We now have the index (DE) and pointer (HL) for an available entry
; in the string table.  We just need to add the index to the chain of
; duplicates for this hash key, and then return the pointer to caller.

HASH3:	EX	(SP),HL		; Swap ptr to last duplicate key entry
	LD	(HL),D		; Add this index to duplicate chain
	DEC	HL
	LD	(HL),E
	POP	HL		; Recover string table ptr
	RET			; Return it to caller

	PAGE
; Get fixed-length code from (old-style) crunched file

; These codes are packed in left-to-right order (msb first).  Two codes
; fit in three bytes, so we alternate processing every other call based
; on a rotating flag word in BITS (initialized to 55H).  Location BITSAV
; holds the middle byte between calls (coding assumes BITSAV = BITS-1).

OGETCR:	CALL	GETCX		; Get next input byte
	EXX			; Save output regs
	RET	C		; Return (carry set) if end of file

	LD	E,A		; Copy byte (high or low part of code)
	LD	HL,BITS		; Point to rotating bit pattern
	RRC	(HL)		; Rotate it
	JR	C,OGETC1	; Skip if this is high part of code

	DEC	HL		; Point to saved byte from last call
	LD	A,(HL)		; Fetch saved byte
	AND	0FH		; Mask low nibble (high 4 bits of code)
	EX	DE,HL		; Get new byte in L (low 8 bits of code)
	LD	H,A		; Form 12-bit code in HL
	RET			; Return (carry clear from mask)

OGETC1:	PUSH	DE		; Save byte just read (high 8 code bits)
	CALL	GETCX		; Get next byte
	EXX			; Save output regs
	POP	HL		; Restore previous byte in L
	RET	C		; But return if eof

	LD	(BITSAV),A	; Save new byte for next call
	AND	0F0H		; Mask high nibble (low 4 bits of code)
	RLA			; Rotate once through carry
	LD	H,A		; Set for circular rotate of HL & carry
	REPT	4
	ADC_HL	HL		;;Form the 12-bit code
	ENDM
	RET			; Return (carry clear after last rotate)

; Output next byte decoded from crunched file

PUTCR:	EXX			; Swap in output registers
	JP	0		; Vector to the appropriate routine
PUTCRP	EQU	$-2		; (ptr to PUT or PUTUP stored here)

	PAGE
; Low-level output routines

; Register usage (once things get going):
;
;  B  = Flag for repeated byte expansion (1 = repeat count expected)
;  C  = Last byte output (saved for repeat expansion)
;  DE = Output buffer pointer
;  HL = CRC value

; Setup registers for output (preserves AF)

PUTSET:	LD	HL,(BUFPAG-1)	; Get buffer start address
	LD	L,0		; (It's always page aligned)
	EX	DE,HL
	LD	H,E		; Clear the CRC
	LD	L,E
	LD	B,E		; Clear repeat flag
	RET			; Return

; Table of starting output buffer pages
; (No. of entries must match ARCVER)

OBUFT:				; Header version:
	DB	HIGH BUFF	; 1 - Uncompressed (obsolete)
	DB	HIGH BUFF	; 2 - Uncompressed
	DB	HIGH BUFF	; 3 - Packed
	DB	HIGH BUFFSQ	; 4 - Squeezed
	DB	HIGH BUFFCX	; 5 - Crunched (unpacked) (old)
	DB	HIGH BUFFCX	; 6 - Crunched (packed) (old)
	DB	HIGH BUFFCX	; 7 - Crunched (packed, faster) (old)
	DB	HIGH BUFFCR	; 8 - Crunched (new)
	DB	HIGH BUFFCQ	; 9 - Squashed

	PAGE
; Unpack and output packed byte

PUTUP:	DJNZ	PUTUP4		; Expecting a repeat count?
	LD	B,A		; Yes ("byte REP count"), save count
	OR	A		; But is it zero?
	JR	NZ,PUTUP2	; No, enter expand loop (did one before)

	LD	A,REP		; Else ("REP 0"),
	JR	PUT		; Go output REP code as data

PUTUP1:	LD	A,C		; Get repeated byte
	CALL	PUT		; Output it

PUTUP2:	DJNZ	PUTUP1		; Loop until repeat count exhausted
	RET			; Return when done

PUTUP3:	INC	B		; Set flag for repeat count next
	RET			; Return (must wait for next call)

PUTUP4:	INC	B		; Normal byte, reset repeat flag
	CP	REP		; But is it the special flag code (REP)?
	JR	Z,PUTUP3	; Yes, go wait for next byte

	LD	C,A		; Save output byte for later repeat

; Output byte (and update CRC)

PUT:	LD	(DE),A		; Store byte in buffer
	XOR	L		; Include byte in lower CRC
	LD	L,A		;  to get lookup table index
	LD	A,H		; Save high (becomes new low) CRC byte
	LD	H,HIGH CRCTAB	; Point to table value low byte
	XOR	(HL)		; Include in CRC
	INC	H		; Point to table value high byte
	LD	H,(HL)		; Fetch to get new high CRC byte
	LD	L,A		; Copy new low CRC byte

	INC	E		; Now that CRC updated, bump buffer ptr
	RET	NZ		; Return if not end of page

	INC	D		; Point to next buffer page
	LD	A,(BUFLIM)	; Get buffer limit page
	CP	D		; Buffer full?
	RET	NZ		; No, return

	PAGE
; Output buffer

PUTBUF:	PUSH	HL		; Save register (i.e. CRC)
	LD	HL,(BUFPAG-1)	; Get buffer start address
	XOR	A		; (it's always page-aligned)
	LD	L,A
	EX	DE,HL		; Swap with buffer end ptr
	SBC_HL	DE		; Compute buffer length
	JR	Z,PUTB2		; But skip all the work if it's empty

	PUSH	BC		; Save register (i.e. repeat flag/byte)
	LD	B,H		; Copy buffer length
	LD	C,L
	LD	HL,(LEN)	; Get (remaining) output file length
	SBC_HL	BC		; Subtract size of buffer
	LD	(LEN),HL	; (Should be zero when we're all done)
	JR	NC,PUTB1	; Skip if double-precision not needed

	LD	HL,(LEN+2)	; Update upper word of length
	DEC	HL
	LD	(LEN+2),HL

PUTB1:	PUSH	DE		; Save buffer start
	CALL	WRTBUF		; Write the buffer
	POP	DE		; Reset output ptr for next refill
	POP	BC		; Restore register

PUTB2:	POP	HL		; Restore register
	RET			; Return to caller

	PAGE
; Write buffer to disk

WRTBUF:	LD	A,(OFLAG)	; Output file open?
	OR	A
	JR	Z,TYPBUF	; No, go typeout buffer instead

	LD	H,D		; Get buffer end ptr
	LD	L,E
	ADD	HL,BC
	JR	WRTB2		; Enter loop

WRTB1:	LD	(HL),CTLZ	; Fill last record with CP/M EOF...
	INC	HL
	INC	BC

WRTB2:	LD	A,L		; Buffer ends on a CP/M record boundary?
	AND	7FH
	JR	NZ,WRTB1	; No, loop until it does

	OR	B		; At least one page to write?
	JR	Z,WRTB4		; Skip if not

WRTB3:	PUSH	BC		; Save remaining byte count
	CALL	WRTREC		; Output 2 records to disk (i.e. 1 page)
	CALL	WRTREC		; (Note returns A=0 as expected below)
	POP	BC		; Restore count
	DJNZ	WRTB3		; Loop for all (full) pages in buffer

WRTB4:	OR	C		; Half-page left?
	RET	Z		; No, return

; Write record to disk

WRTREC:	LD	HL,128		; Get CP/M record length
	ADD	HL,DE		; Add buffer ptr
	PUSH	HL		; Save next record start
	CALL	SETDMA		; Set to write from buffer ptr
	LD	C,$WRITE	; Write a record to output file
	CALL	OFDOS
	POP	DE		; Restore ptr for next call
	DEC	A		; Write error?
	RET	Z		; No, return

	LD	DE,DSKFUL	; Disk is full, report error
	JP	PABORT		;  and abort

	PAGE
; Typeout buffer

TYPBUF:	LD	A,(CHECKF)	; Just checking file?
	OR	A
	RET	NZ		; Yes, ignore buffer

	LD	A,(PROUTF)	; Printer output enabled?
	OR	A
	JR	NZ,PRTBUF	; Yes, go print buffer instead

; Note:	The file typeout facility was originally added to this program
;	as an afterthought.  The primitive nature of this facility has
;	been enhanced considerably with the addition of screen pauses in
;	UNARC 1.4.  Areas for future improvement include intelligent
;	handling of screen width and terminal characteristics.

TYPB0:	LD	A,(DE)		; Fetch next byte from buffer
	CP	CTLZ		; Is it CP/M end-of-file?
	JP	Z,EXIT		; Yes, exit program early

	PUSH	BC		; Save remaining byte count
	INC	A		; Bump ASCII code (simplifies DEL test)
	AND	7FH		; Mask to 7 bits
	CP	' '+1		; Is it a printable char?
	DEC	A		; (Restore code)
	JR	C,TYPB3		; Skip if non-printable

TYPB1:	CALL	PCHAR		; Type char

TYPB2:	INC	DE		; Bump ptr to next byte
	POP	BC		; Restore byte count
	DEC	BC		; Reduce count
	LD	A,B		; Done all bytes?
	OR	C
	JR	NZ,TYPB0	; No, loop for next
	RET			; Yes, return to caller

TYPB3:	CP	HT		; Is (non-printing) char a tab?
	JR	Z,TYPB1		; Yes, go type it
	JR	C,TYPB2		; But ignore if low control char
	CP	CR		; Does char generate a new line?
	JR	NC,TYPB2	; No, ignore control char (incl. CR)

	CALL	CRLF		; Yes (LF/VT/FF), start a new line
	PUSH	DE		; Save buffer ptr
	CALL	CABORT		; Good place to check for CTRL-C abort
	POP	DE		; Restore ptr
	LD	HL,LINCT	; Point to line count
	INC	(HL)		; Bump for one more line
	JR	Z,TYPB2		; But skip if 256 (must be no limit)

	LD	A,(TYLIM)	; Get max allowed lines
	CP	(HL)		; Reached limit (e.g. for RCP/M)?
	JR	NZ,TYPB2	; No, go back to typeout loop
	CALL	WHLCK		; But is wheel byte set?
	JR	Z,TYPB2		; Yes, do not enforce limit

	LD	DE,TYPERR	; Else, report too many lines
	JP	PABORT		;  and abort

	PAGE
; Print buffer

; This added in UNARC 1.41 as a quick hack to allow printing of
; highly-compressed binary plot images.  It may not be suitable for
; general text file listing.  (In particular, CTRL-Z is not treated
; as a file terminator.)

PRTBUF:	EX	DE,HL		; Buffer ptr -> HL

PRTB1:	LD	E,(HL)		; Fetch next byte from buffer
	PUSH	HL		; Save buffer ptr
	PUSH	BC		; Save remaining byte count
	LD	C,$LIST		; Print byte (on listing device)
	CALL	BDOS
	CALL	CABORT		; Check for CTRL-C abort
	POP	BC		; Restore byte count
	POP	HL		; Restore ptr
	INC	HL		; Bump to next byte in buffer
	DEC	BC		; Reduce count
	LD	A,B		; Done all bytes?
	OR	C
	JR	NZ,PRTB1	; No, loop for next

	RET			; Yes, return to caller

	PAGE
	SUBTTL	Listing Routines

; List file information

LIST:	LD	HL,(TFILES)	; Get total files so far
	LD	A,H		; Test if this is first file
	OR	L
	INC	HL		; Add one more
	LD	(TFILES),HL	; Update total files
	CALL	Z,LTITLE	; If first file, list column titles

	LD	DE,SIZE		; Point to compressed file size
	PUSH	DE		; Save for later
	LD	HL,TSIZE	; Update total compressed size
	CALL	LADD

	LD	DE,LEN		; Point to uncompressed length
	PUSH	DE		; Save for later
	LD	HL,TLEN		; Update total length
	CALL	LADD

	LD	HL,LINE		; Setup listing line pointer
	LD	DE,OFCB+@FN	; List file name from output FCB
	LD	C,0		; (with blank fill)
	CALL	LNAME

	POP	DE		; Recover file length ptr
	PUSH	DE		; Save again for factor calculation
	CALL	LTODA		; List file length
	CALL	LDISK		; Compute and list disk space
	CALL	LSTOW		; List stowage method and version
	POP	BC		; Restore uncompressed length ptr
	POP	DE		; Restore compressed size ptr
	CALL	LSIZE		; List size and compression factor
	LD	A,(DATE)	; Check for valid file date
	OR	A		; (This anticipates no-date CP/M files)
	JR	NZ,LIST1	; Skip if valid

	LD	B,18		; Else, clear out date and time fields
	CALL	FILLB
	JR	LIST2		; Skip

LIST1:	CALL	LDATE		; List file date
	CALL	LTIME		; List file time

LIST2:	CALL	LCRC		; List CRC value

	PAGE
; Terminate and print listing line

LISTL:	LD	DE,LINE		; Setup listing line ptr
	JR	LIST3		; Go finish up and list it

; List file totals

LISTT:	LD	HL,LINE		; Setup listing line ptr
	LD_DE	(TFILES)	; List total files
	CALL	WTODA
	LD	DE,TLEN		; List total file length
	PUSH	DE		;  and save ptr for factor calculation
	CALL	LTODA
	LD_DE	(TDISK)		; List total disk space
	CALL	LDISK1
	LD	B,13		; Fill next columns with blanks
	CALL	FILLB
	POP	BC		; Recover total uncompressed length ptr
	LD	DE,TSIZE	; Get total compressed size ptr
	CALL	LSIZE		; List overall size, compression factor
	LD	B,20		; Fill next columns with blanks
	CALL	FILLB
	LD_DE	(TCRC)		; List sum of all CRC values
	CALL	WHEX
	LD	DE,TOTALS	; Point to totals string (precedes line)

LIST3:	LD	(HL),0		; Terminate listing line
	JR	PRINTL		; Go print it, followed by new line

; Print character

PCHAR:	CP	BEL		; Is it a noisy one?
	JR	NZ,PCHAR1	; No, skip
	LD	HL,BELLS	; Yes, is silence desired?
	AND	(HL)
	RET	Z		; Yes, keep quiet

PCHAR1:	PUSH	DE		; Save register

PCHAR2:	LD	E,A		; Setup char
	DEC	A		; But is it special program name marker?
	JR	Z,PNAME		; Yes, go insert name

	LD	C,$CONOUT	; Send to BDOS console output
	CALL	BDOS
	POP	DE		; Restore register
	RET			; Return

; Print program name string, followed by blank

PNAME:	LD	DE,USAGE	; Point to name string in help message

PNAME1:	LD	A,(DE)		; Reached trailing blank?
	CP	' '
	JR	Z,PCHAR2	; Yes, back to PCHAR to print it

	CALL	PCHAR		; Print name char
	INC	DE		; Point to next
	JR	PNAME1		; Loop until blank delimiter

; Print string on new line, then start another

PRINTX:	CALL	CRLF

; Print string, then start new line

PRINTL:	CALL	PRINTS

; Start new line
; Note:	Must preserve DE

CRLF:	LD	A,CR
	CALL	PCHAR
	LD	A,LF
	CALL	PCHAR

	LD	HL,LPSCT	; Reached end of screen?
	DEC	(HL)
	RET	NZ		; No, return

	LD	A,0		; But are screen pauses enabled?
LPS	EQU	$-1		; (lines per screen = 0 if not)
	OR	A
	RET	Z		; No, return

	LD	(HL),A		; Reset count of lines left
	PUSH	DE		; Save register
	LD	DE,MORE		; Print '[more]' on the new line
	CALL	PRINTS

CRLF1:	CALL	CABORT		; Wait for char (or ^C abort)
	JR	Z,CRLF1

	PUSH	AF		; Save input response
	LD	DE,NOMORE	; Blank out the '[more]' line
	CALL	PRINTS
	POP	AF		; Restore response
	POP	DE		; Restore register
	XOR	' '		; Was response the space bar?
	RET	NZ		; Anything else scrolls another screen

	INC	A		; Yes, set to pause after one more line
	LD	(LPSCT),A
	RET			; Return

	PAGE
; Print string on new line

; Note:	Restricted to at most 5 stack levels (c.f. CHECK).  CRLF will
;	not perform page pause during this restriction, but PCHAR will
;	execute PNAME (during ABOMSG print), so we're now at the limit!

PRINT:	CALL	CRLF

; Print NUL-terminated string

PRINTS:	LD	A,(DE)
	OR	A
	RET	Z

	CALL	P,PCHAR		; (Ignore help msg chars with MSB set)
	INC	DE
	JR	PRINTS

; Output warning message about extracted file

OWARN:	PUSH	DE
	LD	DE,WARN
	CALL	PRINTS
	POP	DE
	JR	PRINTL

	PAGE
; List column titles

; Note:	This saves some much-needed space, by using the same template
;	to generate the title line and the 'equal signs' separator line.

LTITLE:	CALL	CRLF
	LD	DE,TITLES
	PUSH	DE
	LD	A,(DE)

LTITL1:	CP	'='		; For titles, convert '=' to blank
	JR	NZ,LTITL2
	LD	A,' '

LTITL2:	CALL	PCHAR
	INC	DE
	LD	A,(DE)
	OR	A
	JR	NZ,LTITL1

	POP	DE
	CALL	CRLF

LTITL3:	LD	A,(DE)
	OR	A
	JR	Z,CRLF

	CP	' '		; Separator converts non-blank to '='
	JR	Z,LTITL4
	LD	A,'='

LTITL4:	CALL	PCHAR
	INC	DE
	JR	LTITL3

	PAGE
; List file name

; Note:	We use name in output file FCB, rather than original name in
;	archive header (illegal chars already filtered by GETNAM).
;	This routine also called by INIT to unparse ARC file name.

LNAME:	LD	B,12		; Setup count for name, '.', and type

LNAME1:	LD	A,B		; Get count
	CP	4		; At end of name?
	LD	A,'.'
	JR	Z,LNAME2	; Yes, go store separator

	LD	A,(DE)		; Get next char
	INC	DE
	CP	C		; Ignore blanks (possibly)
	JR	Z,LNAME3

LNAME2:	LD	(HL),A		; Store char
	INC	HL

LNAME3:	DJNZ	LNAME1		; Loop for all chars in name and type
	RET			; Return to caller

	PAGE
; Compute and list disk space for uncompressed file

LDISK:	PUSH	HL		; Save line ptr
	LD	HL,(LEN)	; Convert file length to 1k disk space
	LD	A,(LEN+2)	; (Most we can handle here is 16 Mb)
	LD	DE,1023		; First, round up to next 1k
	ADD	HL,DE
	ADC	A,0
	RRA			; Now, shift to divide by 1k
	RR	H
	RRA
	RR	H
	AND	3FH
	LD	L,H		; Result -> HL
	LD	H,A
	LD	A,(LBLKSZ)	; Get disk block size
	DEC	A		; Round up result accordingly
	LD	E,A
	LD	D,0
	ADD	HL,DE
	CPL			; Form mask for lower bits
	AND	L
	LD	E,A		; Final result -> DE
	LD	D,H
	LD	HL,(TDISK)	; Update total disk space used
	ADD	HL,DE
	LD	(TDISK),HL
	POP	HL		; Restore line ptr

LDISK1:	CALL	WTODA		; List result
	LD	(HL),'k'
	INC	HL
	RET

	PAGE
; List stowage method and version

LSTOW:	CALL	FILL2B		; Blanks first
	EX	DE,HL
	LD	HL,STOWTX	; Point to stowage text table
	LD	A,(VER)		; Get header version no.
	PUSH	AF		; Save for next column
	LD	BC,8		; Use to get correct text ptr
	CP	3
	JR	C,LSTOW1
	ADD	HL,BC
	JR	Z,LSTOW1
	ADD	HL,BC
	CP	4
	JR	Z,LSTOW1
	ADD	HL,BC
	CP	9
	JR	C,LSTOW1
	ADD	HL,BC
	JR	Z,LSTOW1
	ADD	HL,BC

LSTOW1:	LDIR			; List stowage text
	EX	DE,HL		; Restore line ptr
	POP	AF		; Recover version no.

LSTOW2:	LD	B,3		; List in 3 cols, blank-filled
	JP	BTODB		;  and return

	PAGE
; List compressed file size and compression factor

LSIZE:	PUSH	DE		; Save compressed size ptr
	PUSH	BC		; Save uncompressed length ptr
	CALL	LTODA		; List compressed size
	POP	DE		; Recover length ptr
	EX	(SP),HL		; Save line ptr, recover size ptr

; Compute compression factor = 100 - [100*size/length]
; (HL = ptr to size, DE = ptr to length, A = result)

	PUSH	DE		; Save length ptr
	CALL	LGET		; Get BCDE = size
	LD	H,B		; Compute 100*size
	LD	L,C		;  in HLIX:
	PUSH	DE
	POP_IX			;     size
	ADD_IX	IX
	ADC_HL	HL		;   2*size
	ADD_IX	DE
	ADC_HL	BC		;   3*size
	ADD_IX	IX
	ADC_HL	HL		;   6*size
	ADD_IX	IX
	ADC_HL	HL		;  12*size
	ADD_IX	IX
	ADC_HL	HL		;  24*size
	ADD_IX	DE
	ADC_HL	BC		;  25*size
	ADD_IX	IX
	ADC_HL	HL		;  50*size
	ADD_IX	IX
	ADC_HL	HL		; 100*size
	EX	(SP),HL		; Swap back length ptr, save upper
	CALL	LGET		; Get BCDE = length
	PUSH_IX
	POP	HL		; Now have (SP),HL = 100*size
	LD	A,B		; Length = 0?
	OR	C		; (Unlikely, but possible)
	OR	D
	OR	E
	JR	Z,LSIZE2	; Yes, go return result = 0

	LD	A,101		; Initialize down counter for result

LSIZE1:	DEC	A		; Divide by successive subtractions
	SBC_HL	DE
	EX	(SP),HL
	SBC_HL	BC
	EX	(SP),HL
	JR	NC,LSIZE1	; Loop until remainder < length

LSIZE2:	POP	HL		; Clean stack
	POP	HL		; Restore line ptr
	CALL	BTODA		; List the factor
	LD	(HL),'%'
	INC	HL
	RET			; Return

	PAGE
; List file creation date

; ARC files use MS-DOS 16-bit date format:
;
; Bits [15:9] = year - 1980
; Bits  [8:5] = month of year
; Bits  [4:0] = day of month
;
; (All zero means no date, checked before call to this routine)

LDATE:	LD	A,(DATE)	; Get date
	AND	1FH		; List day
	CALL	BTODA
	LD	(HL),' '	; Then a blank
	INC	HL
	EX	DE,HL		; Save listing line ptr
	LD	HL,(DATE)	; Get date again
	PUSH	HL		; Save for listing year (in upper byte)
	ADD	HL,HL		; Shift month into upper byte
	ADD	HL,HL
	ADD	HL,HL
	LD	A,H		; Get month
	AND	0FH
	CP	13		; Make sure it's valid
	JR	C,LDATE1
	XOR	A		; (Else will show as "???")
LDATE1:	LD	C,A		; Use to index to 3-byte string table
	LD	B,0
	LD	HL,MONTX
	ADD	HL,BC
	ADD	HL,BC
	ADD	HL,BC
	LD	C,3
	LDIR			; Move month text into listing line
	EX	DE,HL		; Restore line ptr
	LD	(HL),' '	; Then a blank
	INC	HL
	POP	AF		; Recover high byte of date
	SRL	A		; Get 1980-relative year
	ADD	A,80		; Get true year in century

LDATE2:	LD	BC,256*2+'0'	; Setup for 2 digits with high-zero fill
	JR	BTOD		;  and convert binary to decimal ASCII

	PAGE
; List file creation time

; ARC files use MS-DOS 16-bit time format:
;
; Bits [15:11] = hour
; Bits [10:5]  = minute
; Bits  [4:0]  = second/2 (not shown here)

LTIME:	EX	DE,HL		; Save listing line ptr
	LD	HL,(TIME)	; Fetch time
	LD	A,H		; Copy high byte
	RRA			; Get hour
	RRA
	RRA
	AND	1FH
	LD	B,'a'		; Assume am
	JR	Z,LTIME1	; Skip if 0 (12 midnight)

	CP	12		; Is it 1-11 am?
	JR	C,LTIME2	; Yes, skip

	LD	B,'p'		; Else, it's pm
	SUB	12		; Convert to 12-hour clock
	JR	NZ,LTIME2	; Skip if not 12 noon

LTIME1:	LD	A,12		; Convert 0 to 12

LTIME2:	PUSH	BC		; Save am/pm indicator
	ADD	HL,HL		; Shift minutes up to high byte
	ADD	HL,HL
	ADD	HL,HL
	PUSH	HL		; Save minutes
	EX	DE,HL		; Recover listing line ptr
	CALL	LSTOW2		; List hour
	LD	(HL),':'	; Then ":"
	INC	HL
	POP	AF		; Restore and list minutes
	AND	3FH
	CALL	LDATE2
	POP	AF		; Restore and list am/pm letter
	LD	(HL),A
	INC	HL
	RET			; Return

	PAGE
; List hex CRC value

LCRC:	CALL	FILL2B
	LD_DE	(CRC)
	PUSH	HL
	LD	HL,(TCRC)	; Update CRC total
	ADD	HL,DE
	LD	(TCRC),HL
	POP	HL

; List hex word in DE

WHEX:	CALL	DHEX
	LD	D,E

; List hex byte in D

DHEX:	LD	(HL),D
	RLD
	CALL	AHEX
	LD	A,D

; List hex nibble in A

AHEX:	OR	0F0H
	DAA
	CP	60H
	SBC	A,1FH
	LD	(HL),A
	INC	HL
	RET

; A few decimal ASCII conversion callers, for convenience

WTODA:	LD	B,5		; List blank-filled word in 5 cols
WTODB:	LD	C,' '		; List blank-filled word in B cols
	JR	WTOD		; List C-filled word in B cols

BTODA:	LD	B,4		; List blank-filled byte in 4 cols
BTODB:	LD	C,' '		; List blank-filled byte in B cols
	JR	BTOD		; List C-filled byte in B cols

LTODA:	LD	BC,9*256+' '	; List blank-filled long in 9 cols
;	JR	LTOD

	PAGE
; Convert Long (or Word or Byte) Binary to Decimal ASCII
; R. A. Freed
; 2.0	15 Mar 85

; Entry:	A  = Unsigned 8-bit byte value (BTOD)
;		DE = Unsigned 16-bit word value (WTOD)
;		DE = Pointer to low byte of 32-bit long value (LTOD)
;		B  = Max. string length (0 implies 256, i.e. no limit)
;		C  = High-zero fill (0 to suppress high-zero digits)
;		HL = Address to store ASCII byte string
;
; Return:	HL = Adress of next byte after last stored
;
; Stack:	n+1 levels, where n = no. significant digits in output
;
; Notes:	If B > n, (B-n) leading fill chars (C non-zero) stored.
;		If B < n, high-order (n-B) digits are suppressed.
;		If only word or byte values need be converted, use the
;		 shorter version of this routine (WTOD or BTOD) instead.

RADIX	EQU	10		; (Will work with any radix <= 10)

LTOD:	PUSH	DE		; Entry for 32-bit long pointed to by DE
	EXX			; Save caller's regs, swap in alt set
	POP	HL		; Get pointer and fetch value to HADE
	LD	E,(HL)
	INC	HL
	LD	D,(HL)
	INC	HL
	LD	A,(HL)
	INC	HL
	LD	H,(HL)
	EX	DE,HL		; Value now in DAHL
	JR	LTOD1		; Join common code

BTOD:	LD	E,A		; Entry for 8-bit byte in A
	LD	D,0		; Copy to 16-bit word in DE

WTOD:	PUSH	DE		; Entry for 16-bit word in DE, save it
	EXX			; Swap in alt regs for local use
	POP	HL		; Recover value in HL
	XOR	A		; Set to clear upper bits in DE
	LD	D,A

; Common code for all entries

LTOD1:	LD	E,A		; Now have 32-bit value in DEHL
	LD	C,RADIX		; Setup radix for divides
	SCF			; Set first-time flag
	PUSH	AF		; Save for stack emptier when done

	PAGE
; Top of conversion loop

; Method:  Generate output digits on stack in reverse order.  Each loop
; divides the value by the radix.  Remainder is the next output digit,
; quotient becomes the dividend for the next loop.  Stop when get zero
; quotient or no. of digits = max. string length.  (Always generates at
; least one digit, i.e. zero value has one "significant" digit.)

LTOD2:	CALL	DIVLB		; Divide to get next digit
	OR	'0'		; Convert to ASCII (clears carry)
	EXX			; Swap in caller's regs
	DJNZ	LTOD5		; Skip if still more room in string

; All done (value fills string), this is the output loop

LTOD3:	LD	(HL),A		; Store digit in string
	INC	HL		; Bump string ptr

LTOD4:	POP	AF		; Unstack next digit
	JR	NC,LTOD3	; Loop if any

	RET			; Return to caller

; Still more room in string, test if more significant digits

LTOD5:	PUSH	AF		; Stack this digit
	EXX			; Swap back local regs
	LD	A,H		; Last quotient = 0?
	OR	L
	OR	D
	OR	E
	JR	NZ,LTOD2	; No, loop for next digit

; Can stop early (no more digits), handle leading zero-fill (if any)

	EXX			; Swap back caller's regs
	OR	C		; Any leading fill wanted?
	JR	Z,LTOD4		; No, go to output loop

LTOD6:	LD	(HL),A		; Store leading fill
	INC	HL		; Bump string ptr
	DJNZ	LTOD6		; Repeat until fill finished
	JR	LTOD4		; Then go store the digits

	PAGE
	SUBTTL	Miscellaneous Support Routines

; Note:	The following general-purpose routine is currently used in this
;	program only to divide longs by 10 (by decimal convertor, LTOD).
;	Thus, a few unneeded code locations have been commented out.
;	(May be restored if program requirements change.)

; Unsigned Integer Division of Long (or Word or Byte) by Byte
; R. A. Freed

; Divisor in C, dividend in (A)DEHL or (A)HL or L (depends on call used)
; Quotient returned in DEHL (or just HL), remainder in A

;DIVXLB:OR	A		; 40-bit dividend in ADEHL (A < C)
;	JR	NZ,DIVLB1	; Skip if have more than 32 bits

DIVLB:	LD	A,D		; 32-bit dividend in DEHL
	OR	E		; But is it really only 16 bits?
	JR	Z,DIVWB		; Yes, skip (speeds things up a lot)

	XOR	A		; Clear high quotient for first divide

DIVLB1:	CALL	DIVLB2		; Get upper quotient first, then swap:
DIVLB2:	EX	DE,HL		; Upper quotient in DE, lower in HL

DIVXWB:	OR	A		; 24-bit dividend in AHL (A < C)
	JR	NZ,DIVWB1	; Skip if have more than 16 bits

DIVWB:	LD	A,H		; 16-bit dividend in HL
	CP	C		; Will quotient be less than 8 bits?
	JR	C,DIVBB1	; Yes, skip (small dividend speed-up)

	XOR	A		; Clear high quotient

DIVWB1:	LD	B,16		; Setup count for 16-bit divide
	JR	DIVB		; Skip to divide loop

;DIVBB:	XOR	A		; 8-bit dividend in L
DIVBB1:	LD	H,L		; For very small nos., pre-shift 8 bits
	LD	L,0		; High byte of quotient will be zero
	LD	B,8		; Setup count for 8-bit divide

; Top of divide loop (vanilla in-place shift-and-subtract)

DIVB:	ADD	HL,HL		; Divide AHL (B=16) or AH (B=8) by C
	RLA			; Shift out next remainder bit
;	JR	C,DIVB1		; (This needed only for divsors > 128)
	CP	C		; Greater than divisor?
	JR	C,DIVB2		; No, skip (next quotient bit is 0)

DIVB1:	SUB	C		; Yes, reduce remainder
	INC	L		;  and set quotient bit to 1

DIVB2:	DJNZ	DIVB		; Loop for no. bits in quotient
	RET			; Done (quotient in HL, remainder in A)

	PAGE
; Fetch a long (4-byte) value

LGET:	LD	E,(HL)		; Fetch BCDE from (HL)
	INC	HL
	LD	D,(HL)
	INC	HL
	LD	C,(HL)
	INC	HL
	LD	B,(HL)
	RET

; Add two longs

LADD:	LD	B,4		; (DE) + (HL) -> (HL)
	OR	A

LADD1:	LD	A,(DE)
	ADC	A,(HL)
	LD	(HL),A
	INC	HL
	INC	DE
	DJNZ	LADD1

	RET

; Fill routines

FILL2B:	LD	B,2		; Fill 2 blanks

FILLB:	LD	C,' '		; Fill B blanks

FILL:	LD	(HL),C		; Fill B bytes with char in C
	INC	HL
	DJNZ	FILL

	RET

; Convert character to upper case

UPCASE:	CP	'a'
	RET	C
	CP	'z'+1
	RET	NC

	ADD	A,'A'-'a'
	RET

	PAGE
IF NOT Z80

; EXX instruction emulator

EXX:
	IRP	AA,<HL,DE,BC>
	PUSH	AA
	LD	HL,(AA&SAV)
	EX	(SP),HL
	LD	(AA&SAV),HL
	ENDM
	POP	BC
	POP	DE
	POP	HL
	RET

; LDIR instruction emulator

LDIR:	PUSH	AF

LDIR1:	LD	A,(HL)
	LD	(DE),A
	INC	HL
	INC	DE
	DEC	BC
	LD	A,B
	OR	C
	JP	NZ,LDIR1

	POP	AF
	RET

; CPIR instruction emulator

CPIR1:	POP	AF

CPIR:	CP	(HL)
	INC	HL
	DEC	BC
	RET	Z

	PUSH	AF
	LD	A,B
	OR	C
	JP	NZ,CPIR1

	POP	AF
	RET

ENDIF
	PAGE
	SUBTTL	Messages and Initialized Data

IF Z80
NOTZ80:	DB	BEL,'Z80 required!$'
ELSE
USEZ80:	DB	'NOTE: The Z80 version is smaller and faster!',CR,LF,'$'
ENDIF
ABOMSG:	DB	BEL,1,'aborted!',0
CPMERR:	DB	'CP/M version 2 or higher required',0
NOROOM:	DB	'Not enough memory',0
NAMERR:	DB	'Ambiguous archive file name',0
OPNERR:	DB	'Cannot find archive file',0
FMTERR:	DB	'Invalid archive file format',0
HDRERR:	DB	BEL,'Warning: Bad archive file header, bytes skipped = '
HDRSKP:	DB	'00000',0
NOFILS:	DB	'No matching file(s) in archive',0
BADIDR:	DB	'Invalid archive file drive',0
BADODR:	DB	'Invalid output drive',0
ARCMSG:	DB	'Archive File = '
ARCNAM:	DB	'FILENAME.ARC',0
OUTMSG:	DB	'Output Drive = '
OUTDRV:	DB	'A:',0
CHKMSG:	DB	'Checking archive...',0
BADVER:	DB	'Cannot extract file (need newer version of UNARC?)',0
EXISTS:	DB	BEL,'Replace existing output file (y/n)? ',0
DSKFUL:	DB	'Disk full',0
DIRFUL:	DB	'Directory full',0
CLSERR:	DB	'Cannot close output file',0
UCRERR:	DB	'Incompatible crunched file format',0
TYPERR:	DB	'Typeout line limit exceeded',0
WARN:	DB	BEL,'Warning: Extracted file has incorrect ',0
CRCERR:	DB	'CRC',0
LENERR:	DB	'length',0
MORE:	DB	'[more]',0
NOMORE:	DB	CR,'       ',HT,CR,0

; Note:	Tab (HT) added above in UNARC 1.5 for proper following tab
;	expansion (since CP/M 2.2 BDOS does not reset its column
;	position after raw CR output).  The blanks are still generated
;	in case of BDOS implementations which do not expand tabs.

MONTX:	DB	'???JanFebMarAprMayJunJulAugSepOctNovDec'

STOWTX:	DB	'Unpacked'
	DB	' Packed '
	DB	'Squeezed'
	DB	'Crunched'
	DB	'Squashed'
	DB	'Unknown!'

TITLES:	DB	'Name========  =Length  Disk  =Method= Ver =Stored Save'
	DB	'd ===Date== =Time=  CRC='
LINLEN	EQU	$-TITLES
	DB	0

TOTALS:	DB	'        ====  =======  ====               =======  ==='
	DB	'                    ===='
	DB	CR,LF
	DB	'Total  '	; (LINE must follow!)

; .COM file ends here (except for non-Z80 self-unpacking startup code)

COMLEN	EQU	$-TBASE		; Length of initialized code and data

	PAGE
	SUBTTL	Data Storage

; Unitialized data last (does not contribute to .COM file size)

; Note:	Following macro introduced in UNARC 1.5 to avoid use of the
;	assembler DS directive, which generates unneeded records in the
;	.COM file when linked with L80 (unlike SLRNK).  (Also preserves
;	location counter for self-unpacking initialization code in the
;	non-Z80 version.)

DSS	MACRO	SYM,BYTES
SYM	EQU	$D
$D	DEFL	$D+(BYTES)
	ENDM

$D	DEFL	$		; Start of data storage (pseudo PC)
	DSS	LINE,LINLEN+1	; Listing line buffer (follow TOTALS!)

$D	DEFL	$D+(25*2)	; Program stack (25 levels)
STACK	EQU	$D		; (Too small will only garbage listing)

TOTS	EQU	$D		; Start of listing totals
	DSS	TFILES,2	;  Total files processed
	DSS	TLEN,4		;  Total uncompressed bytes
	DSS	TDISK,2		;  Total 1K disk blocks
	DSS	TSIZE,4		;  Total compressed bytes
	DSS	TCRC,2		;  Total of all CRC values
	DSS	LINCT,1		; Line count for file typeout
	DSS	ARKFLG,1	; Default file type flag (allows .ARC)
	DSS	PROUTF,1	; Printer output flag
	DSS	CHECKF,1	; Check archive validity flag
TOTC	EQU	$D-TOTS		; Count of bytes to clear

	DSS	GETPTR,2	; Input buffer pointer
	DSS	LPSCT,1		; Lines per screen counter
	DSS	LBLKSZ,1	; Disk allocation block size for listing
	DSS	TNAME,11	; Test pattern for selecting file names
	DSS	OFCB,@FCBSZ	; Output file FCB
;	DSS	IFCB,@FCBSX	; Input file FCB
IFCB	EQU	DFCB		; (Currently using default FCB instead)

HDRBUF	EQU	$D		; Archive file header buffer...
	DSS	VER,1		; Header version no. (stowage type)
	DSS	NAME,13		; Name string (NUL-terminated)
	DSS	SIZE,4		; Compressed bytes
	DSS	DATE,2		; Creation date
	DSS	TIME,2		; Creation time
	DSS	CRC,2		; Cyclic check of uncompressed file
	DSS	LEN,4		; Uncompressed bytes (version > 1)
HDRSIZ	EQU	$D-HDRBUF	; Header size (4 less if version = 1)

IF NOT Z80

; Data for Z80 instruction emulation

	DSS	HLSAV,2		; HL'
	DSS	DESAV,2		; DE'
	DSS	BCSAV,2		; BC'
	DSS	AFSAV,2		; AF'
	DSS	IXSAV,2		; IX

ENDIF

MINMEM	EQU	$D-1		; Min memory limit (no file output)

	PAGE
; Data for file output processing only

				; Following order required:
	DSS	BUFPAG,1	;  Output buffer start page
	DSS	BUFLIM,1	;  Output buffer limit page

				; Following order required:
	DSS	CODES,1		;  Code count for crunched input
	DSS	BITSAV,1	;  Bits save for crunched input
	DSS	BITS,1		;  Bit count for crunched input

	DSS	STRCT,2		; No. entries in crunched string table

; Tables and buffers for file output
; (All of the following must be page-aligned)

$D	DEFL	($D+255) AND 0FF00H ; Align to page boundary

	DSS	CRCTAB,256*2	; CRC lookup table (256 2-byte values)

BUFF	EQU	$D		; Output buff for non-squeezed/crunched

				; or:

TREE	EQU	$D		; Decoding tree for squeezed files
TREESZ	EQU	256*4		; (256 4-byte nodes)
BUFFSQ	EQU	TREE+TREESZ	; Output buffer for squeezed files

				; or:

STRT	EQU	$D		; String table for crunched files
STRSZ	EQU	4096*3		; (4K 3-byte entries)
BUFFCR	EQU	STRT+STRSZ	; Output buffer for newer crunched files

				; plus (for old-style crunched files):

HSHT	EQU	BUFFCR		; Extra table for hash code chaining
HSHSZ	EQU	4096*2		; (4K 2-byte entries)
BUFFCX	EQU	HSHT+HSHSZ	; Output buffer for older crunched files

				; or (for squashed files):

STQSZ	EQU	8192*3		; (8K 3-byte string table entries)
BUFFCQ	EQU	STRT+STQSZ	; Output buffer for squashed files

	PAGE
IF NOT Z80

; Initialization for self-unpacking archive file (non-Z80 version only)

; Note:	Following is needed only when UNARCA.COM is executed from a
;	self-unpacking archive file.  It is subsequently overlayed by
;	data during program execution, so the only additional run-time
;	overhead for self-unpacking support is the 26 bytes immediately
;	preceding BEGIN.  (The added disk space for this code is also
;	minimal, and none of this is included in the Z80-only version,
;	UNARC.COM, which applies to the majority of users.)

	.PHASE	$+26		; This code is offset 26 bytes in memory

SELFUP:	LD	C,$DISK		; Get current default disk drive no.
	CALL	BDOS		; (archive file drive)
	LD	B,A		; Save default for extracted files
	ADD	A,'A'		; Get ASCII drive letter
	LD	(SELFMD),A	; Store in archive file name message
	LD	DE,DFCB		; Point to default FCB
	LD	A,(DE)		; Disk drive specified on command line?
	OR	A
	JP	NZ,SELFU1	; Yes, skip to use it

	LD	A,B		; Recover default disk no.
	INC	A		; Convert to drive code

SELFU1:	LD	(SELFXD),A	; Store drive code for extracted files
	ADD	A,'A'-1		; Get ASCII drive letter
	LD	(SELFCD),A	; Store in command line
	LD	HL,SELFCB	; Point to fixed internal FCB
	LD	BC,SELFSZ	; Get no. bytes to move to system page
	CALL	SELFMV		; Move down fixed command parameters

	LD	DE,TBASE	; Setup normal .COM file base
	LD	HL,TBASE+26	; Setup current (offset) base in memory
	LD	BC,COMLEN	; Setup .COM file length
	CALL	SELFMV		; Relocate .COM file to its proper place

	LD	(CCPSV),A	; Force reboot later (and max. buffer)
	INC	A		; Set default disk block size to 1K
	LD	(DBLSZ),A	; (e.g., might be running CP/M-68K)

	LD	A,'$'		; Patch usage message
	LD	(USEA),A	;  for program identification
	LD	(USEB),A	;  and copyright displays only
	LD	DE,SELFCR	; Start with a blank display line
	CALL	SELFPR
	LD	DE,USAGE	; Show program id
	CALL	SELFPR
	LD	DE,USEC		; Show copyright
	CALL	SELFPR
	LD	DE,SELFMS	; Show archive file name (new user aid)
	CALL	SELFPR
	JP	BEGIN1		; Go begin (skip Z80 warning note)

; Brute force memory mover (can't use LDIR emulation yet)

SELFMV:	LD	A,(HL)
	LD	(DE),A
	INC	HL
	INC	DE
	DEC	BC
	LD	A,B
	OR	C
	JP	NZ,SELFMV
	RET			; Return with A = 0

; Print message via BDOS (can't use internal print routines yet)

SELFPR:	LD	C,$PRTSTR
	JP	BDOS

; Fixed FCB's and command line for self-unpacking file extraction

SELFCB:	DB	0		; Archive file drive (default always)
	SELF			; Archive file name
	REPT	SELFCB+9-$	; (pad with blanks to 8 chars)
	DB	' '
	ENDM
	DB	'COM'		; Archive file type (always .COM)
	DB	0,0,0,0		; Extent descriptor bytes

SELFXD:	DB	0		; Drive code for file extraction
	DB	'           '	; Files to extract (defaults to *.*)
	DB	0,0,0,0		; Extent descriptor bytes
	DB	0,0,0,0		; Current and random record nos.

	DB	SELFCE-SELFCL	; Command line length (moves to DBUF)
SELFCL:	DB	' '		; Command line tail...
	SELF
	DB	'.COM '		; (e.g. ' UNARC15.COM A: N')
SELFCD:	DB	'A: N'		; (extract all files, no screen pauses)
SELFCE:	DB	0		; (end of command line)

SELFSZ	EQU	$-SELFCB	; Size of fixed command data to move

; Message naming self-unpacking archive file

SELFMS:	DB	CR,LF,LF,'(Self-unpacking file '
SELFMD:	DB	'A:'
	SELF
	DB	'.COM)'
SELFCR:	DB	CR,LF,'$'

	.DEPHASE

; End of special self-unpacking code for non-Z80 version

ENDIF

; That's all, folks!

	IF	($ AND 7FH) NE 0

; Clear out final record of the .COM file
; (Needed only for precise M80/L80 compatibility with Z80ASM/SLRNK)

	REPT	128-($ AND 7FH)
	DB	0
	ENDM

	ENDIF

	END	BEGIN