Bill Moore's Computer Translation Tutorial Page
WHAT IS A SHORT SENTENCE TRANSLATOR?
0. BASIC CONCEPTS. All medical English is divided into
STOP WORDS (punctuation, numerals, articles, prepositions, pronouns,
common adjectives and verbs) and KEY WORDS (everything else).
In the IAD, these words are contained
global-variables on the MUMPS-based translator.
Stop words (particularly of, the, and) are the most common words
in any free-text English document, according to George K. Zipf,
a Harvard professor of humanities, who wrote a classic treatise
on this subject in the late 1940s (mostly using works of James
Joyce), then promptly died, so that the whole world could feel
safe in ignoring his work. Zipf's Law states that, if you rank
the words in a large document in descending order of word-frequency
(i.e., most-frequent-word has rank-one, second-most-frequent-word
has rank-two, etc.), then word-rank is inversely proportional
to word-frequency. According to Zipf's Law, if the rank-one
(i.e., most-frequent) word has, say 15,000 occurrences, then the
rank-two word will have 7,500 occurrences, the rank-three word
will have 3,750 occurrences, etc. There are studies on large
documents in English, German, and Chinese, which demonstrate
this relationship.
Stop words are words that nobody would want to search for
in an index. An important corollary of Zipf's Law is that
the most frequent hundred words in a large document are almost
exclusively stop words, and account for well over half the
verbiage in the document. They clutter up the indexing globals,
and they distract the user from making a well-targeted search.
Many stop words are common to all English (articles, prepositions,
etc.), but some are idiosyncratic to a particular field, and
must be identified and classified by an expert. I am still
debating this, but I think that 'disease', 'patient', and 'syndrome'
are medical stop-words, because they are so common as to be
nearly meaningless. These words would probably NOT be stop words
in any other context. By executive fiat, I have declared all
one-letter, two-letter, and numeric strings as stop-words,
but there are notable exceptions.
1. PREPROCESSING. The first step in indexing a free-text
document is to drop the whole document to lower-case, and perform
a preprocessing step, so that certain important concepts won't be lost.
In the good old days, medical English consisted of numerous,
long Latin words, whose only drawback was that they were
frequently misspelled. Poor spelling in a free-text
document is death to the indexer. These days, we have decent
spelling checkers on electronic documents, but medical English
is filling up with short words that appear as stop words
to the IAD computer program (cd4, p53, etc.). You cannot fail
to index these wordlets, or else many important concepts will become
unavailable to the user. The IAD approach, not necessarily a very
good one, is to perseverate the alphabetic character, so as to form
a three-letter term, which is then saved by the index. Therefore,
p-53 is indexed under ppp, ki-1 is indexed under kii, etc. Concepts
containing stop words, like 't cell' or 'in vivo' are likewise
transformed by global ^IADBPREP() into tcell and invivo.
2. STOP WORDS. After the free-text document has been
preprocessed, it is purged of all stop words, including punctuation,
numerals, articles, prepositions, pronouns, common adjectives
and verbs.
3. EXACT WORDS. The exact form of every key word (i.e.,
non-stop-word) in the document is indexed by my program.
However, this is only the beginning of the indexing process.
4. WORD PREFIXES. There are many medical prefixes in which
the user may recall only the suffix, but would like to locate cases
with the entire word. For example, a user querying for 'carcinoma'
might also wish to locate cases of 'adenocarcinoma', where 'adeno'
is a common medical prefix. Therefore, my program redundantly
indexes 'adenocarcinoma' both as its exact form, ADENOCARCINOMA,
and as its reduced form, CARCINOMA.
5. SINGULAR, COMMON NOUN. It would be annoying for the user
to query for LUNG and miss all cases in which contain LUNGS.
For that matter, the user asking for LUNG should also not miss
cases containing PULMONARY or PULMONIC or even (from the wretched
pedant who wrote up that case) PULMON. Therefore, I maintain
a global which maps each keyword into the most common,
singular, noun form for that word. For example:
lungs^^lung
pulmon^^lung
pulmonaries^^lung
pulmonary^^lung
pulmonic^^lung
Similarly:
livers^^liver
hepar^^liver
hepatic^^liver
hepato^^liver
hepatocellular^^liver^cell
Similarly:
cystic^^cyst
cysts^^cyst
6. HYPHENATED TERMS. Most concepts in medical English
exist as multiple word terms, which may be connected by a blank
space (mitral valve), a hyphen (mitral-valve), or a null-string
(mitralvalve). The word database has undergone this evolution
over the past two decades. It is interesting that English,
which is historically an amalgam of French and Germanic languages,
has a typically French method of expressing a compound-word
concept (hyphenation) and a typically German method (agglomeration).
Space-connected words cannot be recognized by the computer
program unless they appear in an appropriate list, which can be
obtained by the Barrier Word Method. The IAD program indexes
multiple word terms redundantly both as the single component words,
or as the corresponding multiple-words, recognized by global
^IADBMULT().
WHAT IS THE BARRIER WORD METHOD?
In the following sample text, the stop-words are lower case,
and the KEY-WORDS are UPPER CASE:
ACTINIC DEGENERATION . BASOPHILIC DEGENERATION is present
in the UPPER DERMIS . the COLLAGEN BUNDLES there have been
replaced by AMORPHOUS MATERIAL staining faintly BASOPHILIC ,
when stained by HEMATOXYLIN and EOSIN . ELECTRON MICROSCOPIC
EXAMINATION of areas of ACTINIC DEGENERATION shows ELASTOTIC
MATERIAL as the main component . even though this ELASTOTIC
MATERIAL resembles ELASTIC TISSUE in its CHEMICAL COMPOSITION ,
it differs significantly in appearance from AGED ELASTIC FIBERS ,
as seen in UNEXPOSED AGED SKIN .
That is:
stop-words: KEY-WORDS:
. ACTINIC
is DEGENERATION
present BASOPHILIC
in DEGENERATION
the ........ UPPER ........
This division of the universe into stop-words and key-words
supports a valuable trick: the detection of multiple-word terms,
such as ACTINIC DEGENERATION, BASOPHILIC DEGENERATION,
UPPER DERMIS, COLLAGEN BUNDLES, etc. You will notice that in the
above text, the medical multiple-word terms are bounded on either
side by a stop-word barrier (including punctuation, numerals, etc.).
It is an easy matter to write a program which collects a file
of multiple-word terms using this method. You can then revise
the list of stop words and key words, and apply the method to the
document, to refine the list of stop words.
8. SNOMED TRANSLATION. Currently, SNOMED-translation of words
and concepts in the IAD is performed by simple word-substitution
or multiple-word-term substitution on the free-text document
(after the above processing), using an enriched SNOMED synonym
list.
WHY IS SNOMED USED?
SNOMED International, copyright 1994, by the College of American
Pathologists (CAP), is much richer than it was in its 1980s edition,
and is particularly rich in terms used by pathologists. Furthermore,
the current version is rich in Veterinary Medicine terms. The IAD
has a limited license to use 'SNOMED-compatible terms', such as
'Lung, NOS' (NOS=not otherwise specified), but NOT T-28000.
WHY IS UMLS NOT USED FOR THE IAD?
Why not MeSH or UMLS? It is our understanding, that
the products of the U.S. National Library of Medicine,
may not be freely distributed outside the USA. Since the
Internet is an international resource, this restriction
effectively rules out the UMLS for the IAD.
WHY IS ICD NOT USED FOR THE IAD?
How about the International Classification of Disease (ICD)?
This is a product of the World Health Organization, with ABSOLUTELY
NO RESTRICTIONS on worldwide distribution. This is also the coding
system used by the U.S. Health Care Financing Administration (HCFA),
who pay many medical bills in the USA. Only one problem: it's not
as rich in raw terms as SNOMED.
WHY IS READ CLASSIFICATION NOT USED FOR THE IAD?
How about the Read classification, used in the United Kingdom?
This is the intellectual property of Her Majesty, Elizabeth II.
Even single user license are quite expensive.
WHAT IS ZIPF'S LAW?
The extremely high frequency of a few words in any running text
may be characterized by `Zipf's Law'. If one determines
the frequency of each word in the document, and sorts the words
in the descending order of frequency, then the most frequent word
is said to have `rank 1', the second-most frequent word has `rank 2',
etc. Zipf's Law states that word-frequency is inversely proportional
to word-rank. On linear graph paper, the plot of word-frequency
against word-rank is a hyperbola; on log-log graph paper, the plot
of word-frequency against word-rank is a straight line with a
negative slope. Large samples of English, German, and Chinese text
have been shown to satisfy Zipf's Law. The essential feature
of Zipf's Law is that a short list of words account for a large
proportion of all word-occurrences in any natural language text.
Similarly, there is a short list of grammar-formulas, which
account for half of all grammar-formula-occurrences in a given
text. We assert that, as with words, grammar formulas sufficient
to translate a majority of the document belong to a short list,
a sort of `Zipf's Law' for grammar formulas.
The consistent presence of Zipf's Law as a property of both words
and grammar formulas in natural language text suggests a strategy for
constructing both the LEXICON and GRAMMAR. First, one obtains
a frequency distribution for both words and parsing formulas.
Then one should fashion the major design considerations around these
high-frequency elements in the LEXICON or GRAMMAR. Next,
moderate-frequency elements should be fashioned around the models
created for high-frequency forms. Finally, low-frequency forms
which do not fit the existing design may be flagged for pre- or post-
editing.
There is a very general relationship between rank, r, in
a descending order word list and the frequency, f, of each word,
called Zipf's Law. Let r=1 for the most common word, r=2 for the
second-most common word, etc. Then f*r=k for some constant, k.
That is, there is a hyperbolic relationship (straight line with
negative slope on log-log graph paper) between word-rank and
word-frequency in large text documents. This relationship
is approximately true for (1) the works of James Joyce;
(2) the two million words in the century-old autopsy files
of The Johns Hopkins Hospital; (3) nine million German words
from the Heidelberg Cancer Center; and (4) twenty-five million
Chinese characters from the Xinhua Printing Factory in Beijing.
HOW CAN I BUILD A STOPWORD LIST?
You can produce a first-rate list in a few hours. First, guess.
You can obtain a first-draft list of stop words consisting
of punctuation, numerals, single letters, double letters,
articles, conjunctions, prepositions, auxiliary
verbs, and common adjectives and verbs (mild, moderate,
come, exhibit, demonstrate, go, etc.). Next, produce a list
of multiple-word terms consisting of consecutive non-stop-words
bounded on either side by stop words. Finally, eyeball
the multiple-word term list, and new stop words will jump out
at you. Enrich your stop word list with these new stop words,
and repeat the process all over again. There are a few problems
with this method: for example, 'in situ' and 'in vitro' will
not be discovered. These terms must be added manually to your
multiple-word term list.
REFERENCES
Moore GW, Boitnott JK, Miller RE, Eggleston JC,
Hutchins GM. Integrated pathology reporting, indexing, and retrieval
system using natural language diagnoses. Modern Pathol.
1988; 1:44-50.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. A SNOMED
analysis of three years' accessioned cases of (40,124) of a surgical
pathology department: implications for pathology-based demographic
studies. Journal of the American Medical Informatics Association
(JAMIA), Symposium Supplement 1994 and the Proceedings of the 18th
Annual Symposium for Computer Appllications in Medicine (SCAMC),
pp. 188-192, 1994
Berman JJ, Moore GW. SNOMED-Encoded surgical pathology
databases: a tool for epidemiologic investigation. In press,
Modern Pathology.
Moore GW, Berman JJ, Hanzlick RL, Buchino JJ,
Hutchins GM. A prototype international autopsy
database: 1625 consecutive fetal and neonatal autopsy
facesheets spanning twenty years. Archives Pathol Lab Med
120:782-785, 1996.
Moore GW, Berman JJ. Automatic versus manual SNOMED coding:
quantitating the differences. Am J Clin Pathol 101:253-256, 1994.
Moore GW, Berman JJ. Automatic SNOMED Coding. Journal of the
American Medical Informatics Association (JAMIA), Symposium Supplement
1994 and the Proceedings of the 18th Annual Symposium for Computer
Applications in Medical Care (SCAMC), pp 225-229, 1994
Berman JJ, Moore GW, Donnelly WH, Massey, JK,
Craig B. SNOMED Analysis of 40,124 Surgical Pathology
Cases. Am J Clin Pathol 102:539-540, 1994
Berman JJ, Moore GW. SNOMED-Encoded surgical pathology
databases: a tool for epidemiologic investigation.
Modern Pathology 9:944-950, 1996.
Silverberg SG. SNOMED-Encoded surgical pathology databases:
's no big deal - or is it?. Modern Pathology 9:953-954, 1996
Moore GW, Berman JJ. Automatic SNOMED Coding.
Journal of the American Medical Informatics Association
(JAMIA), Symposium Supplement 1994 and the Proceedings
of the 18th Annual Symposium for Computer Appllications
in Medicine (SCAMC), pp 225-229, 1994
Moore GW, Berman JJ. Automatic versus manual
SNOMED coding: quantitating the differences. Am J
Clin Pathol 101:253-256, 1994
Moore GW, Hutchins GM, Berman JJ. Object oriented
programming system for autopsy retrieval. Modern Pathol
5:100A, 1992 and Lab Invest 66:100A, 1992
(abstract). Moore GW, Berman JJ, Hanzlick RL, Buchino JJ,
Hutchins GM. A prototype national autopsy
databank: 1,625 consecutive fetal and neonatal
autopsy facesheets spanning twenty years.
Conference on Restructuring Autopsy Practice
for Health Care Reform, May 25-26, 1995. Willard
Inter-Continental Hotel, Washington, D.C.
Moore GW, Berman JJ. Object-oriented
English-to-Snomed translator using Transoft + Hyperpad.
Symposium on Computer Applications in Medical Care
15:973-975, 1991
Moore GW, Hutchins GM, Berman JJ. Object-oriented
retrieval system for the Johns Hopkins autopsy database.
MedInfo 92, Lun KC, Degoulet P, Piemme TW, Rienhoff, eds.
Elsevier, Amstedam, 1992, p 1613
Berman JJ. Solving Quality assurance problems
with object scripting languages. Artificial Intelligence
in Medicine 3:161-171, 1991
Moore GW, Boitnott JK, Miller RE, Eggleston JC,
Hutchins GM. Integrated pathology reporting, indexing, and retrieval
system using natural language diagnoses. Modern Pathol.
1988; 1:44-50.
Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles
of natural language pathology phrases identified on first encounter
using the barrier word method. In: Scherrer JR, Cote RA, Mandil SH,
eds. Computerized Natural Medical Language Processing for Knowledge
Representation. Amsterdam: North-Holland; 1989: 29-39.
Hutchins GM and the Autopsy Committee of the College
of American Pathologists: Practice guidelines for autopsy pathology:
autopsy reporting. Arch Pathol Lab Med. 1995; 119:123-130.
Hutchins GM and the Autopsy Committee of the College
of American Pathologists: Practice guidelines for autopsy
pathology: autopsy reporting. Arch Pathol Lab Med. 1995;
119:123-130.
Cote RA, Rothwell DJ. The classification-nomenclature issues
in medicine: a return to natural language. Medical Informatics
14:25-41, 1989.
Rothwell DJ, Cote RA, Brochu L. The systematized nomenclature
of human and veterinary medicine, SNOMED International Microglossary
for pathology. College of American Pathologists,
Northfield, IL, 1993.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. SNOMED
Analysis of 40,124 Surgical Pathology Cases. Am J Clin Pathol
102:539-540, 1994.
Moore GW, Miller RE, Hutchins GM: Microcomputer translator
for medical text: theorem verification for chapter two of Zeman's
modal logic. Adv Math Comput Med 7:1621-1633, 1986.
Woods W: Transition network grammars for natural language
analysis. Commun Assn Comp Mach 13:591-606, 1970.
Legreaux RM and Diehl S. Ask Byte: Se Habla ASCII?
Byte. 17(6):365,1992.
Schwanke M. Machinelle Uebersetzung. Ein Ueberblick ueber
Theorie und Praxis. Berlin, Heidelberg, New York: Springer-Verlag,
1991, pp. 11-45.
Pogson G: Controlled English: Enlightenment through Constraint.
Language Technology 6:22-25, 1988.
Ogden CK: The System of Basic English. New York, 1934.
Moore GW, Boitnott JK, Miller RE, Eggleston JC,
Hutchins GM. Integrated pathology reporting, indexing, and
retrieval system using natural language diagnoses. Modern
Pathol. 1988; 1:44-50.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B.
A SNOMED analysis of three years' accessioned cases of
(40,124) of a surgical pathology department: implications
for pathology-based demographic studies. Journal of the
American Medical Informatics Association (JAMIA), Symposium
Supplement 1994 and the Proceedings of the 18th Annual
Symposium for Computer Appllications in Medicine (SCAMC),
pp. 188-192, 1994
Moore GW, Hutchins GM, Miller RE. Strategies for
searching medical language text: Distribution of words
in the anatomical diagnoses of 7000 autopsied patients.
Am J Pathol 115:36-41, 1984.
Moore GW, Hutchins GM, Miller RE. Examination
of disease names using non-Abelian symbolic logic.
Methods Inform. Med. 25:109-115, 1986.
Moore GW, Polacsek RA, Erozan YS, de la Monte SM,
Miller RE, Hutchins GM, Riede UN. Multilingual translation
techniques in the analysis of narrative medical text.
Comput. Programs Biomed. 22:35-42, 1986.
Moore GW, Miller RE, Hutchins GM. Determining cause
of death in 45,564 autopsy reports. Theoret. Med.
9:179-186, 1988.
Moore GW, Wakai I, Satomura Y, Giere W. TRANSOFT:
Medical translation expert system. Artif Intell Med
1:149-157, 1989.
Moore GW, Berman JJ. Performance Analysis of Manual
and Automated Systematized Nomenclature of Medicine
(SNOMED) Coding. Am J Clin Pathol. 101:253-256, 1994.
Moore GW, Miller RE, Hutchins GM, Riede UN,
Polacsek RA. Multilingual translation techniques in the
analysis of narrative medical text. Proc. 9th Ann. Symp.
Comp. Appl. Med. Care, November 10-13, 1985, Baltimore, MD.
Moore GW, Miller RE, Hutchins GM. Determining cause
of death in 45,504 autopsy reports. Workshop on Ethical
and Quality Aspects of Computerized Medical Person
Registers. Stockholm, Sweden, March 26-27, 1987.
Moore GW, Hutchins GM, Boitnott JK, Miller RE,
Polacsek RA. Word root translation of 45,564 autopsy
reports into MeSH titles. Proceedings of Symposium on
Computer Applications in Medical Care, Washington DC,
November 1-4, 1987.
Moore GW, Miller RE, Hutchins GM. Indexing by MeSH
titles of natural language pathology phrases identified
on first encounter using the barrier word method.
In, Scherrer JR, Cote RA, and Mandil SH, eds.,
Computerized Natural Medical Language Processing
for Knowledge Representation. North-Holland,
Amsterdam, 1989.
Tersmette KWF, Scott AF, Moore GW, Matheson NW,
Miller RE. Barrier word method for detecting molecular
biology multiple word terms. Proceedings of 12th Annual
Symposium of Computer Applications in Medical Care,
November 6-9, 1988, Washington, D.C.
Moore GW, Berman JJ. Automatic SNOMED Coding.
Proceedings of JAMIA Eighteenth Annual Symposium
on Computer Applications in Medical Care 18:225, 1994.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B.
A SNOMED analysis of three years' accessioned cases
(40,124) of a surgical pathology department: implications
for pathology-based demographic studies. Symposium
on Computer Applications in Medical Care 18: 188, 1994.
Altman RB, Oliver DE (1994) Extraction of SNOMED
Concepts from Medical Record Texts. Proceedings of JAMIA
Eighteenth Annual Symposium on Computer Applications
in Medical Care. November 5-9, Washington D.C. p. 179, 1994.
Barrows RC, Cimino JJ, Clayton PD. (1994) Mapping Clinically
Useful Terminology to a Controlled Medical Vocabulary. Proceedings
of JAMIA Eighteenth Annual Symposium on Computer Applications
in Medical Care. November 5-9, Washington D.C. p. 211, 1994.
Bell DS, Cimino JJ, Evans DA, Hersh WR, Huff ST. (1994)
Toward a Medical-Concept Representation Language. J Am Med
Informatics Assoc, May-Jun; 1(3):207-217, 1994.
Cimino JJ, Hripcsak G, Johnson BB, Clayton PD: Designing
an Introspective,Controlled Medical Vocabulary, in Kingsland LK,
ed.: Proceedings of the Thirteenth Annual; Symposium on Computer
Applications in Medical Care; Washington, DC;
November 1989:513-518, 1989.
Cimino JJ. Coding systems in health care. In van Bemmel JH,
McCray AT, eds.: Yearbook of Medical Informatics, International
Medical InformaticsAssociation, Rotterdam, 1995:71-85, 1995.
Cimino JJ. Vocabulary and health care information technology:
state of the art. Journal of the American Society for Information
Science;1995;45(10):777-782, 1995.
Sager N, Lyman M, Nhan NT, Tick LJ. (1994) Automatic Encoding
into SNOMED III: A Preliminary Investigation. Proceedings of JAMIA
Eighteenth Annual Symposium on Computer Applications in Medical
Care. November 5-9, Washington D.C. p. 230, 1994.
Wingert F. (1988) Medical Linguistics: Automated Indexing
into SNOMED. Crit Rev Med Inform; 1(4):333-403, 1988.
Wingert F, Rothwell D, Cote RA. (1988) Automated Indexing
into SNOMED and ICD. Proceedings of IFIP-IMIA WG6 International
Working Conference on Computerized Natural Medical Language
Processing for Knowledge Representation, WHO Geneva,
September 12-15, 1988.
Wingert F. (1987) Automated Indexing of SNOMED Statements
into ICD. Methods Inform Med, Jul; 26(3):93-8, 1987.
Wingert F. An Indexing System for SNOMED. Methods Inform Med,
Jan;25(1):22-30, 1986.
Wingert F. Automated Indexing Based on SNOMED.
Methods Inf Med, Jan;24(1): 27-34, 1985.
Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM.
Integrated pathology reporting, indexing, and retrieval system
using natural language diagnoses. Modern. Pathol. 1:44-50, 1988.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B.
SNOMED analysis of 40,124 surgical pathology cases.
ASCP/CAP fall, 1994, meeting, Washington, DC.
Moore GW, Miller RE, Hutchins GM. Determining cause
of death in 45,504 autopsy reports. Workshop on Ethical
and Quality Aspects of Computerized Medical Person Registers.
Stockholm, Sweden, March 26-27, 1987.
Moore GW, Hutchins GM, Boitnott JK, Miller RE, Polacsek RA.
Word root translation of 45,564 autopsy reports into MeSH titles.
Proceedings of SCAMC -- Symposium on Computer Applications
in Medical Care, Washington DC, November 1-4, 1987.
Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles
of natural language pathology phrases identified on first
encounter using the barrier word method. In, Scherrer JR, Cote RA,
and Mandil SH, eds., Computerized Natural Medical Language
Processing for Knowledge Representation. North-Holland,
Amsterdam, 1989.
Moore GW, Berman JJ. Automatic SNOMED Coding. Symposium
on Computer Applications in Medical Care 18: 1994.
Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B.
A SNOMED analysis of three years' accessioned cases (40,124)
of a surgical pathology department: implications for pathology-
based demographic studies. Symposium on Computer Applications
in Medical Care 18: 1994.
Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles
of natural language pathology phrases identified on first encounter
using the barrier word method. In: Scherrer JR, Cote RA, Mandil SH,
eds. Computerized Natural Medical Language Processing for Knowledge
Representation. Amsterdam: North-Holland; 1989: 29-39.
Moore GW, Berman JJ. Automatic versus manual SNOMED coding:
quantitating the differences. Am J Clin Pathol 101:253-256, 1994
Giere W, Moore GW. Translating English into German
using VA File Manager. M Computing, 1:16-23, 1993.
Baumann RP: Knowledge representation in an autopsy data
bank. In, Computerized Natural Medical Language Processing for
Knowledge Representation (Cote R, Scherrer J-R, Mandil S, (eds),
North-Holland, Amsterdam, 1988.
Baumann RP and Brandon G: SNOMED in four languages:
English, French, German, and MUMPS. In, MEDINFO 83 (van Bemmel
JH, Ball MJ, Wigertz O (eds), North-Holland, Amsterdam, 1983,
pp 112-115.
Locke, W.N. and Booth, A.D. (eds.): Historical
Introduction. In, Machine Translation of Languages.
The Technology Press of Massachusetts Institute of Technology
and John Wiley & Sons, New York, 1955.
Bar-Hillel B: Language and Information. Selected Essays
on Their Theory and Application. Addison-Wesley Publishing Co.,
Inc., Reading, Mass., 1964.
Chomsky N: Aspects of the Theory of Syntax.
The Massachusetts Institute of Technology Press, Cambridge, Mass., 1965.
Language and Machines. Computers in Translation and
Linguistics. Publication 1416. A report by the Automatic
Language Processing Advisory Committee, Division of Behavioral
Sciences, National Academy of Sciences, National Research Council,
Washington, D.C., 1966, pp. 1-34.
Loh S-C: Machine translation: past, present, and future.
ALLC Bulletin 4:105-114, 1976.
Jordan SR, Brown AFR, Hutton FC: Computerized
Russian translation at ORNL. J. Am. Soc. Info. Sci.
28:26-33 1977.
Hutchins WJ: Progress in documentation. Machine
translation and machine-aided translation. J. Document
34:119-159, 1978.
Carbonell, J.G., Cullingford, R.E., Gershman, A.V.:
Steps toward knowledge-based machine translation. IEEE Trans.
PAMI-3:376-392, 1981.
Winograd, T.: Computer software for working with language.
Sci. Am. 251:130-145, 1984.
Tucker AB Jr: A perspective on machine translation:
Theory and Practice. Commun. ACM 27:322-329, 1984.
Hutchins, W. G.: Machine Translation: Past, Present and
Future. Ellis Horwood, 1986.
Wingert, F. Lecture notes in Medical Informatics. In,
Medical Informatics. An Introduction (Lindberg, D.A.B.
Reichertz, P.L. (eds.)). Springer-Verlag, New York, 1981, pp 22-25.
McCray AT, Sponsler JL, Brylawski B, Browne A: The role of
lexical knowledge in biomedical text understanding. Proceedings
of the Eleventh Annual Symposium on Computer Applications
in Medical Care Nov 1-4, 1987. Washington, DC: IEEE Computer
Society Press. 1987 pp. 103-107.
Bar-Hillel Y: The present status of automatic translation
of languages. Adv Comput l:9l-l63, l960.
Aho AV and Johnson SC: LR parsing. Comput Surveys 6:99-124, 1974.
Bochmann GV: Semantic evaluation from left to right.
Commun ACM l9:55-62, l976.
Pager D: A practical general method for constructing LR(k)
parsers. Acta Inf 7:249-268, l977.
Alekseev VD: Automatic construction of descending left-hand
parsers. Auto Document Math Ling l6:4l-47, l982.
Aho AV and Ullman JD: Principles of Compiler Design.
Addison-Wesley Publishing Co., Reading, Mass, l979.
Barnett GO, Justice NS, et al: COSTAR - A computer-based
medical information system and data language for ambulatory
practices. Computer l2:9-l7, l979.
Horowitz GL, Bleich HL: PaperChase: A computer program
to search the medical literature. N Engl J Med 305:924-930, 1981.
Robboy SJ, Altshuler BS, Chen HY: Retrieval in a computer-
assisted pathology encoding and reporting system (CAPER).
Am J Clin Pathol 75:654-661, 1981.
Wong RL, Reno JD, Hain TC, Platt RC, Gaynon PS, Joseph DM:
Profile of a dictionary compiled from scanning over one million
words of surgical pathology narrative text. Comput Biomed Res
13:382-398, 1980.
Salton G: Dynamic Information and Library Processing.
Prentice-Hall, Inc, Englewood Cliffs, NJ, 1975.
Giere W: Foundations of clinical data automation in
cooperative programs. 5th Ann Symp Comp Applic Med Care
(Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
Zhang Q: Easy entry of Chinese character set symbols.
Proc 5th Ann Symp Comp Appl Med Care (Heffernan HG, ed).
Washington, D.C., 1981, pp 143-149.
Zipf GK: On the Economy of Words. Chapter 2 in, Human
Behavior and The Principle of Least Effort. An Introduction
to Human Ecology. Addison-Wesley Press, Inc., Cambridge, MA,
1949, pp. 19-55.
Condon EU: Statistics of vocabulary. Science 67:300, 1928.
Josselson HH: Automatic translation of languages since
1960: A linguist's view. Adv Comput 11:1-58, 1971.
Joshi AK. Natural language processing.
Science 253:1242-1249.
Aho AV, Ullman JD: Principles of Compiler Design.
Addison-Wesley Publishing Co., Reading, Mass, 1979.
Sager N, Grisham R: The restriction language for
computer grammars of natural language. Commun ACM 18:390-400, 1975.
Hirschman L, Story G, Marsh E, Lyman M and Sager N:
An experiment in automated health care evaluation from narrative
medical records. Comput Biomed Res 14:447-463, 1981.
Story G, Hirschman L: Data base design for natural
language medical data. J Med Syst 6:77-88, 1982.
Sager N, Bross IDJ, Story G, Bastedo P, Marsh E, Shedd D:
Automated encoding of clinical narrative. Comput Biol Med
12:43-56, 1982.
Horowitz GL and Bleich HL: PaperChase: A computer program
to search the medical literature. N Engl J Med 305:924-930, 1981.
Robboy SJ, Altshuler BS, Chen HY: Retrieval in a computer-
assisted pathology encoding and reporting system (CAPER).
Am J Clin Pathol 75:654-661, 1981.
Ledley RS and Lusted LB: Reasoning foundations of medical
diagnosis. Science 130:9-21, 1959.
Kayser K: Logic and diagnosis.
Methods Inf Med 14:76-80, 1975.
Vasconcellos M: Overview: machine translation.
BYTE 18:152-166, 1993.
Hovy E: How MT works. BYTE 18:167-176, 1993.
Resource guide: machine-translation software.
BYTE 18:185-186, 1993.
Pogson G: Controlled English: Enlightenment through
Constraint. Language Technology 6:22-25, 1988.
Ogden CK: The System of Basic English. New York, 1934.
Schwanke M: Machinelle Uebersetzung. Ein Ueberblick
ueber Theorie und Praxis. Berlin, Heidelberg, New York:
Springer-Verlag, 1991, pp. 91-95.
Woods W: Transition network grammars for natural language
analysis. Commun Assn Comp Mach 13:591-606, 1970.
Vasconcellos M: Overview: machine translation.
BYTE 18:152-166, 1993.
Nirenburg S (ed): Machine Translation. Theoretical and
Methodological Issues. London: Cambridge University Press, 1987.
Vasconcellos M, Leon M: SPANAM and ENGSPAN:
Machine translation at the Pan American Health Organization.
Comput Linguist 11:122-136, 1985.
INTERNET: The Oxford Text Archive: ftp://ota.ox.ac.uk.
A fantastic collection of various wordlists, including
an 8 megabyte unabridged dictionary; various English language
lists (British, American, Australian, Shakespeare), as well as
lists in Afrikaans, Chinese, Croatian, Danish, Dutch, Esperanto,
Finnish, French, German, Hindi, Hungarian, Italian, Japanese,
Latin, Norwegian, Polish, Russian, Spanish, Swahili, Swedish,
Yiddish.
Vasconcellos M: Overview: machine translation.
BYTE 18:152-166, 1993.
Hovy E: How MT works. BYTE 18:167-176, 1993.
Resource guide: machine-translation software.
BYTE 18:185-186, 1993.
King M: Are There any Lessons to be Learned from Machine
Translation? In, Scherrer JR, Ct, RA, and Mandil SH (eds.),
Computerized Natural Medical Language Processing for Knowledge
Representation. Amsterdam: Elsevier Science Publishers B.V.
pp. 73-82, 1989.
Nirenburg, S. (ed.): Machine Translation. Theoretical
and Methodological Issues. London: Cambridge University Press, 1987.
Hutchins WJ: Machine Translation: Past, Present, Future.
Chichester: Ellis Horwood Ltd, 1986.
Vasconcellos M and Leon M: SPANAM and ENGSPAN: Machine
translation at the Pan American Health Organization. Comput Linguist
11:122-136, 1985.
Tucker A: A perspective on machine translation: theory
and practice. Commun Assn Comp Mach 27:322-329, 1984.
Davis RG: FileMan: A User Manual. Bethesda, MD:
National Association of VA Physicians, 1987.
Woods W: Transition network grammars for natural language
analysis. Commun Assn Comp Mach 13:591-606, 1970.
Barnett GO, Cimino JJ, Hupp JA, Hoffer EP: DXplain, an
evolving diagnostic decision support system. JAMA 258:67-74, 1987.
Zipf GK: On the economy of words. In, Human Behavior
and the Principle of Least Effort, ch. 2, p.19. Cambridge, MA:
Addison-Wesley Press, Inc., 1949.
Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM:
Integrated pathology reporting, indexing, and retrieval system using
natural language diagnoses. Modern Pathol 1:44-50, 1988.
Giere W: Foundations of clinical data automation
in cooperative programs. 5th Ann Symp Comp Applic Med Care
(Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press,
1981. pp. 1142-1148.
Zhang Q: Easy entry of Chinese character set symbols.
5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.) Washington,
DC: IEEE Computer Society Press, 1981. pp. 143-149.
Giere W: Befunddokumentation und Arztbriefschreibung
im Krankenhaus. Taunusstein: Media, 1986.
Giere W: Foundations of clinical data automation
in cooperative programs. 5th Ann Symp Comp Applic Med Care
(Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press,
1981. pp. 1142-1148.
Zhang Q: Easy entry of Chinese character set symbols.
5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.)
Washington, DC: IEEE Computer Society Press, 1981. pp. 143-149.
Tucker A: A perspective on machine translation: theory
and practice. Commun Assn Comp Mach 27:322-329, 1984.
Vasconcellos M and Leon M: SPANAM and ENGSPAN: Machine
translation at the Pan American Health Organization. Comput Linguist
11:122-136, 1985.
Hutchins WJ: Machine Translation: Past, Present, Future.
Chichester: Ellis Horwood Ltd, 1986.
Moore GW, Riede UN, Polacsek RA, Miller RE, and Hutchins GM:
Automated translation of German to English medical text. Am J Med
81:103-111, 1986.
Davis RG: FileMan: A User Manual. Bethesda, MD: National
Association of VA Physicians, 1987.
Nirenburg, S. (ed.): Machine Translation. Theoretical
and Methodological Issues. London: Cambridge University Press, 1987.
Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM:
Integrated pathology reporting, indexing, and retrieval system
using natural language diagnoses. Modern Pathol 1:44-50, 1988.
Wilks Y: A preferential, pattern-seeking, semantics
for natural language inference. Artif. Intel. 6:53-74, 1975.
Arnold D: General view of the design methodology.
In: Multilingua 5(1986)3: special issue. HL Somers (ed.)
Amsterdam: Mouton de Gruyter. EUROTRA Special Issue, pp. 136-139.
King M: Are there any lessons to be learned from machine
translation? Proceedings of the International Working Conference,
World Health Organization, Geneva, Switzerland, 1988, pp 2.1-2.10.
Offerhaus GJA, Tersmette AC, Moore GW, Hershey J, Polacsek RA:
Dutch respelling rules for English and German medical word lists.
Method. Inform. Med. 26:99-103, 1987.
Yu C-Y, Moore GW, Unschuld PU: Romanized Chinese respelling
rules for an English medical word list. Proceedings from SCAMC,
Symposium on Computer Applications in Medical Care, Washington, DC,
November 1-4, 1987, pp 133-136.
Wingert F. Lecture notes in Medical Informatics. In, Medical
Informatics. An Introduction (Lindberg DAB & Reichertz PL (eds.)).
Springer-Verlag, New York, 1981, pp 22-25.
McCray AT, Sponsler JL, Brylawski B, Browne A: The role
of lexical knowledge in biomedical text understanding. Proceedings
of the Eleventh Annual Symposium on Computer Applications in Medical Care
Nov 1-4, 1987. Washington, D.C.: IEEE Computer Society Press. 1987
pp 103-107.
Josselson HH: Automatic translation of languages since 1960:
A linguist's view. Adv Comput 11:1-58, 1971.
Chomsky N: Aspects of the Theory of Syntax. The Massachusetts
Institute of Technology Press, Cambridge, Mass., 1965.
Chomsky N: Language and Mind. Harcourt Brace Jovanovich,
New York, 1968.
Loh S-C: Machine translation: past, present, and future.
ALLC Bulletin 4:l05-ll4, 1976.
Wilks Y: An intelligent analyzer and understander of English.
Commun ACM l8:264-274, l975.
Wilks Y: A preferential, pattern-seeking, semantics for
natural language inference. Artif Intel 6:53-74, l975.
Wilks Y: Frames for machine translation. New Scientist
??:802-803, 1977.
Hutchins WJ: Progress in documentation. Machine translation
and machine-aided translation. J Document 34:119-159, 1978.
Garfield E.: Current comments. Artificial intelligence:
Using computers to think about thinking. Part l. Representing
knowledge. Curr Contents 49:5-17, 1983.
Schank RC, Childers PG: Experiments in Artificial
Intelligence. Computerworld 10:1-28, 1984.
Knuth DE: A characterization of parenthesis languages.
Inf Control 11:269-289, l967.
Bell JR: A new method for determining linear precedence
functions for precedence grammars. Commun ACM l2:3l6-333, l969.
Earley J: An efficient context-free parsing algorithm.
Commun ACM 13:94-102, 1970.
Aho AV, Johnson SC: LR parsing. Comput Surveys 6:99-,24, 1974.
Bochmann GV: Semantic evaluation from left to right.
Commun ACM 19:55-62, 1976.
Pager D: A practical general method for constructing
LR(k) parsers. Acta Inf 7:249-268, 1977.
Alekseev VD: Automatic construction of descending
left-hand parsers. Auto Document Math Ling l6:41-47, 1982.
Aho AV, Ullman JD: Principles of Compiler Design.
Addison-Wesley Publishing Co., Reading, Mass, l979.
Sager N, Grisham R: The restriction language for computer
grammars of natural language. Commun ACM 18:390-400, 1975.
Hirschman L, Story G, Marsh E, Lyman M, Sager N:
An experiment in automated health care evaluation from narrative
medical records. Comput Biomed Res 14:447-463, 1981.
Story G and Hirschman L: Data base design for natural language
medical data. J Med Syst 6:77-88, 1982.
Sager N, Bross IDJ, Story G, Bastedo P, Marsh E, Shedd D:
Automated encoding of clinical narrative.
Comput Biol Med 12:43-56, 1982.
Wolff S: The use of morphosemantic regularities in the
medical vocabulary for automatic lexical coding. Methods Inf Med
23:195-203, 1984.
Fenichel RR and Barnett GO: An application-independent
subsystem for free-text analysis. Comput Biomed Res 9:159-167, 1976.
Weiner JM: Text analysis and basic concept structures.
Inf Proc Mgt 19:313-319, 1983.
Moore GW, Haupt HM, Hutchins GM: A hypothesis test for causal
explanations in human pathology: Evaluation of pulmonary edema in 181
autopsied patients with leukemia. Math Biosci 62:253-279, 1982.
Moore GW and Hutchins GM: Symbolic logic analysis
of congenital heart disease. Pathol Res Pract 171:59-85, 1981.
Moore GW, Hutchins GM: Symbolic logic analysis of cause
of death in humans: Application to l08 patients after coronary
artery bypass graft surgery. J Theor Biol 92:267-291, 1981.
Riede UN, Moore GW, Sandritter W: Symbolic logic as a new
method in quantitative organelle pathology. Exp Mol Pathol
33:259-282, 1980.
Riede UN and Moore GW: Symbolic logic as a new method
in pathobiology and toxicology. CRC Press, 1983.
Moore GW, Hutchins GM: Consistency versus completeness
in medical decision making: Application to l55 patients autopsied
after coronary artery bypass graft surgery. Med Inf 8:197-207, 1983.
Moore GW and Hutchins GM: A Hintikka possible worlds model
for certainty levels in medical decision making.
Synthese 48:89-119, 1981.
Moore GW, Hutchins GM, Bulkley BH: Certainty levels in the
nullity method of symbolic logic: Application to the pathogenesis
of congenital heart malformations. J Theor Biol 76:53-81, 1979.
Gladkij AV and Mel'cuk IA: Elements of Mathematical
Linguistics. (Lehrberger, J., ed.). Mouton Publishers, Berlin, 1983.
Feigenbaum EA and McCorduck P: The Fifth Generation:
Artificial Intelligence and Japan's Computer Challenge to the World.
Addison Wesley, Reading, Ma., 1983.
Becker JD: Multilingual word processing.
Sci Am 25l:96-107, 1984.
Walther von AJ: Translators gain fluency. Multilingual word
processors point to universal communication.
InfoWorld 6:35-37, 1984.
Bulkeley WM: Computers gain as language translators even
though perfect not they always. Wall Street Journal,
February 6, 1985, p 29.
Tucker AB, Jr: A perspective on machine translation:
Theory and Practice. Commun ACM 27:322-329, 1984.
Jordan SR, Brown, AFR, Hutton FC: Computerized Russian
translation at ORNL. J Am Soc Inf Sci 28:26-33, 1977.
Garcia-Hidalgo I, Dunham G: An experiment in English
Spanish automated translation of medical language data.
Methods Inf Med 20:38-46, 1981.
Hoffman La Roche (editor): Roche Lexikon Medizin,
Urban & Schwarzenberg, Munich, l984.
Dunham GS, Pacak MG, Pratt AW: Automated indexing
of pathology data. J Am Soc Inf Sci 29:81-90, 1978.
Platt RC, Wong RL, Lantner KW, Gaynon PS:
Polars: A pathology on-line logging and reporting system.
Comput Biomed Res 7:83-99, 1974.
Sharpe TC, Clark DE: General purpose information
handling techniques for pathological data. Comput Biol Med
5:221-233, 1975.
Okubo RS, Russell WS, Dimsdale B, Lamson BG:
Natural language storage and retrieval of medical diagnostic
information. Experience at the UCLA Hospital and Clinics
over a 10-year period. Comput Prog Biomed 75:105-130, 1975.
Frieder G and Slocum PJ: The lexical attributes of medical
name files. Comput Biol Med 8:81-87, 1978.
Wong RL, Reno JD, Hain TC, Platt RC, Gaynon PS, Joseph DM:
Profile of a dictionary compiled from scanning over one million words
of surgical pathology narrative text.
Comput Biomed Res 13:382-398, 1980.
Hieb BR: A word processor-based pathology reporting system.
Am J Clin Pathol 75:357-363, 1981.
Aller RD: Computers in anatomic pathology.
Clin Lab Med 3:133-147, 1983.
Coles EC, Slavin G: An evaluation of automatic coding
of surgical pathology reports. J Clin Pathol 29:621-625, 1976.
Aller RD, Robboy SJ, Poitras JW, Altshuler BS, Cameron M,
Prior MC, Miao S, and Barnett GO: Computer-assisted pathology
encoding and reporting system (CAPER). An on-line computer system
developed at the Massachusetts General Hospital.
Am J Clin Pathol 68:715-720, 1977.
Foulis PR, Norbut AM, Mendelow H and Kessler, G.F.: Pathology
accessioning and retrieval system with encoding by computer (PARSEC).
Am J Clin Pathol 73:748-753, 1980.
Robboy SJ, Altshuler BS and Chen HY: Retrieval in a
computer-assisted pathology encoding and reporting system (CAPER).
Am J Clin Pathol 75:654-661, 1981.
Cote RA, Robboy S: Progress in medical information management:
Systematized nomenclature of medicine (SNOMED).
JAMA 243:756-762, 1980.
Blekhman MS: Formalization of inexplicit anaphoric links
in systems for processing natural-language texts (with reference
to English-language texts). Auto Document Math Ling 16:98-109, 1982.
Allen JF, Perrault CR: Analyzing intention in dialogues.
Artif Intel 15:143-178, 1980.
Blois MS: Clinical judgement and computers.
N Engl J Med 303:192-197, 1980.
Damerau FJ: A technique for computer detection and correction
of spelling errors. Commun ACM 7:171-176, 1964.
Zamora EM, Pollock JJ and Zamora A: The use of trigram
analysis for spelling error detection. Inf Proc Mgt 17:305-316, 1983.
Hall PAV and Dowling GR: Approximate string matching.
Comput Surveys 12:381-402, 1980.
Peterson JL: Computer programs for detecting and correcting
spelling errors. Commun ACM 23:676-687, 1980.
Durham I, Lamb DA and Saxe JB: Spelling correction in user
interfaces. Commun ACM 26:764-773, 1983.
Pollack JJ, Zamora A: Automatic spelling correction
in scientific and scholarly text. Commun ACM 27:358-368, 1984.
Humphrey SM: File maintenance of MeSH Headings in MEDLINE.
J Am Soc Inf Sci 35:34-44, 1984.
Moore GW, Hutchins GM, Miller RE: Strategies for searching
medical natural language text: Distribution of words in the
anatomical diagnoses of 7000 autopsied patients.
Am J Pathol 115:36-41, 1984.
Arbib MA and Manes EG: Monoids and Groups. Chapter 3 in,
Arrows, Structures and Functors. The Categorial Imperative.
Academic Press, New York, 1975.
Wos L, Overbeek R, Lusk E and Boyle J: Automated Reasoning.
Introduction and Applications. Prentice Hall, Englewood Cliffs,
New Jersey, 1984.
Clocksin WF, Mellish CS: Programming in Prolog.
Springer-Verlag, Heidelberg, 1981.
Malpas J: Programming in logic. Dr Dobb's J l0:36-4l, l985.
von Hansemann D: Ueber das konditionale Denken in der Medizin
und seine Bedeutung fr die Praxis. August Hirschwald Verlag,
Berlin, 1912, pp. 1-31.
Sadegh-zadeh K: Grundlagenproblem einer Theorie
der klinischen Praxis. Teil l: Explikation des medizinischen
Diagnosebegriffs. Metamed 1:76-102, 1977.
Ledley RS, Lusted LB: Reasoning foundations of medical
diagnosis. Science 130:9-21, 1959.
Warner HR, Toronto AF, Veasey LG, Stephenson R:
A mathematical approach to medical diagnosis. Application
to congenital heart disease. JAMA l77:177-183, 1961.
Stein MA, Winter J: Theory development in medical
decision-making. Int J Biomed Comput 5:147-159, 1974.
Kayser K: Logic and diagnosis.
Methods Inf Med 14:76-80, 1975.
Weiss SM, Kulikowski CA, Amarel S, Safir A: A model-based
method for computer-aided medical decision making.
Artif Intel 11:145-172, 1978.
Weiss SM, Kulikowski CA and Safir A: Glaucoma consultation
by computer. Comput Biol Med 8:25-40, 1978.
Pauker SG, Gorry GA, Kassirer JP, Schwartz WB: Towards
the simulation of clinical cognition. Taking a present illness by
computer. Am J Med 60:981-996, 1976.
Szolovitz P, Pauker SG: Categorical and probabilistic
reasoning in medical diagnosis. Artif Intel 11:115-144, 1978.
Shortliffe EH, Buchanan BG: A model of inexact reasoning
in medicine. Math Biosci 23:351-379, 1975.
Shortliffe EH, Buchanan BG and Feigenbaum EA: Knowledge
engineering for medical decision making: A review of computer-based
clinical decision aids. Proc IEEE 67:1207-1224, 1979.
Duda RO and Shortliffe EH: Expert systems research.
Science 220:261-268, 1983.
Pauker SG and Kassirer JP: The threshold approaches
to clinical decision making. N Engl J Med 302:1109-1117, 1980.
Schwartz WB, Wolfe HJ and Pauker SG: Pathology
and probabilities. A new approach to interpreting and reporting
biopsies. N Engl J Med 305:917-923, 1981.
Miller RA, Pople HE Jr and Myers JD: Internist-I,
an experimental computer-based diagnostic consultant for
general internal medicine. N Engl J Med 307:468-476, 1982.
Miller RA: Internist-l/Caduceus: Problems facing expert
consultant programs. Methods Inf Med 23:9-l4, l984.
Halpern JY and Rabin MO: A logic to reason about likelihood.
IBM Research Report RJ 4313, 1984.
Zeman JJ: Modal Logic. The Lewis-Modal Systems,
Oxford at the Clarendon Press, London, 1973.
Anderson JM and Johnstone HW Jr: Natural Deduction.
The Logical Basis of Axiom Systems. Wadsworth Publishing Co, Inc.,
Belmont, CA, 1962.
Quine WV: Theory of Deducation, Harvard Cooperative Society,
Cambridge, Mass, 1948, pp 65-81.
Quine WV: Methods of Logic. Henry Holt & Co, New York, 1950.
Quine WV: The problem of simplifying truth functions.
Am Math Monthly 59:521-531, 1952.
Quine WV: A way to simplify truth functions. Am Math Monthly
62:627-631, 1955.
McCluskey EJ Jr: Minimization of Boolean Functions.
Bell Syst Tech J 36:1417-1444, 1956.
Swets JA: Information retrieval systems.
Science 141:245-250, 1963.
Lusted LB: Introduction to Medical Decision Making.
Charles C Thomas, Springfield, Ill, 1968, pp. 98-140.
Lusted LB: Decision-making studies in patient management.
N Engl J Med 284:416-424, 1971.
Swets JA: Signal Detection in Medical Diagnosis. In, Computer
Diagnosis and Diagnostic Methods (Jacquez JA, ed). Charles C Thomas,
Springfield, Ill, 1972, pp. 8-28.
Meadow CT: The Analysis of Information Systems.
Second Edition. Melville Publishing Co., Los Angeles, CA, 1973.
Lancaster FW: Evaluating the Effectiveness of Information
Services. Chapter 9, In, Information Retrieval Systems:
Characteristics, Testing and Evaluation. Second Edition.
John Wiley & Sons, New York, 1979, pp 121-139.
Date CJ: An Introduction to Database Systems. Addison-Wesley
Publishing Co., Reading, Mass., 1975.
Greenes RA, Pappalardo AN, Marble CW, Barnett GO: Design and
implementation of a clinical data management system. Comput Biomed
Res 2:469-485, 1969.
Bowie J and Barnett GO: MUMPS - An economical and efficient
time-sharing system for information management. Comput Programs
Biomed 6:11-22, 1976.
Dayhoff R (editor): ANS MUMPS Programmers' Reference Manual
1981 and ANS MUMPS Language Standard. MUMPS Users Group,
Washington, DC, 1981.
Watanabe T, Ohsawa T, Suzuki T: A simple database language
for personal computers. Commun ACM 26: 646-653, 1983.
Barnett GO, Justice NS, et al: COSTAR - A computer-based
medical information system and data language for ambulatory
practices. Computer 12:9-17, 1979.
Horowitz GL and Bleich HL: PaperChase: A computer program
to search the medical literature. N Engl J Med 305:924-930, 1981.
Robboy SJ, Altshuler BS, Chen HY: Retrieval in a
computer-assisted pathology encoding and reporting system (CAPER).
Am J Clin Pathol 75:654-661, 1981.
Hutchins WG: Machine Translation: Past, Present and Future.
Ellis Horwood, 1986.
Arnold D: General view of the design methodology. In:
Multilingua 5(1986)3: special issue. HL Somers (ed.) Amsterdam:
Mouton de Gruyter. EUROTRA Special Issue pp. 136-139.
Evans DA, Hobbs JR, Miller PL, Carbonell JG: New directions
for natural language processing in biomedicine. In: Proceedings
of the Eleventh Annual Symposium on Computer Applications
in Medical Care Nov 1-4, 1987.
Salton G: Dynamic Information and Library Processing.
Prentice-Hall, Inc, Englewood Cliffs, NJ, 1975.
Canadian Government Terminology Bank TERMIUM. Booklet
prepared by Terminology Directorate Translation Bureau, Dept.
of the Secretary of State, Ottawa, CANADA, 1985.
Giere W: Foundations of clinical data automation
in cooperative programs. 5th Ann Symp Comp Applic Med Care
(Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
Zhang Q: Easy entry of Chinese character set symbols.
Proc 5th Ann Symp Comp Appl Med Care (Heffernan HG, ed).
Washington, D.C., 1981, pp 143-149.
Zipf, GK: On the Economy of Words. Chapter 2 in,
Human Behavior and The Principle of Least Effort.
An Introduction to Human Ecology. Addison-Wesley Press, Inc.,
Cambridge, MA, 1949, pp 19-55.
Condon EU: Statistics of vocabulary. Science 67:300, 1928.
Chen Y-S and Leimkuhler FF: A relationship between Lotka's
Law, Bradford's Law, and Zipf's Law.
J Am Soc Info Sci 37:307-314, 1986.
Giere W: Foundations of clinical data automation
in cooperative programs. 5th Ann Symp Comp Applic Med Care
(Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
Hall PAV and Dowling GR: Approximate string matching.
ACM Comput Surv 12:381-402, 1980.
Moore GW, Riede UN, Polacsek RA, Miller RE, Hutchins GM:
Automated translation of German to English medical text.
Am J Med 81:103-111, 1986.
Robboy SJ, Altshuler BS and Chen HY: Retrieval in a
computer-assisted pathology encoding and reporting system (CAPER).
Am J Clin Pathol 75: 654-66l, 1981.
Taft RL: Name Search Techniques. Bureau of Systems
Development, New York State Identification and Intelligence System,
Albany, New York, 1984.
Zipf, GK: On the Economy of Words. Chapter 2 in, Human
Behavior and The Principle of Least Effort. An Introduction
to Human Ecology. Addison-Wesley Press, Inc., Cambridge, MA, 1949,
pp. 19-55.
Sun M: Japanese scientific and technological literature
Information: The demand in the U.S. remains low.
Science 238:1032-1033, 1987.
Sun M.: Translation service throws in the towel.
Science 239:1482, 1988.
Kayser K: Logic and diagnosis.
Methods Inf Med 14:76-80, 1975.
Weiss SM, Kulikowski CA, Amarel S, Safir A: A model-based
method for computer-aided medical decision making.
Artif Intel 11:145-172, 1978.
Weiss SM, Kulikowski CA and Safir A: Glaucoma consultation
by computer. Comput Biol Med 8:25-40, 1978.
Pauker SG, Gorry GA, Kassirer JP, Schwartz WB: Towards the
simulation of clinical cognition. Taking a present illness
by computer. Am J Med 60:981-996, 1976.
Szolovitz P, Pauker SG: Categorical and probabilistic
reasoning in medical diagnosis. Artif Intel 11:115-144, 1978.
Shortliffe EH, Buchanan BG: A model of inexact reasoning
in medicine. Math Biosci 23:351-379, 1975.
Shortliffe EH, Buchanan BG and Feigenbaum EA: Knowledge
engineering for medical decision making: A review of computer-based
clinical decision aids. Proc IEEE 67:1207-1224, 1979.
Duda RO, Shortliffe EH: Expert systems research.
Science 220:261-268, 1983.
Pauker SG and Kassirer JP: The threshold approaches
to clinical decision making. N Engl J Med 302:1109-1117, 1980.
Schwartz WB, Wolfe HJ and Pauker SG: Pathology and
probabilities. A new approach to interpreting and reporting
biopsies. N Engl J Med 305:917-923, 1981.
Blum RL: Discovery, confirmation, and incorporation
of causal relationships from a large time-oriented clinical
data base: The RX project. Comput Biomed Res 15:164-187, 1982.
Miller RA, Pople HE Jr and Myers JD: Internist-I,
an experimental computer-based diagnostic consultant for general
internal medicine. N Engl J Med 307:468-476, 1982.
Miller RA: Internist-l/Caduceus: Problems facing expert
consultant programs. Methods Inf Med 23:9-14, 1984.
Halpern JY, Rabin MO: A logic to reason about likelihood.
IBM Research Report RJ 4313, 1984.
Zeman JJ: Modal Logic. The Lewis-Modal Systems,
Oxford at the Clarendon Press, London, 1973.
Anderson JM, Johnstone HW Jr: Natural Deduction.
The Logical Basis of Axiom Systems. Wadsworth Publishing Co, Inc,
Belmont, CA, l962.
Quine WV: Theory of Deducation, Harvard Cooperative Society,
Cambridge, Mass, 1948, pp 65-81.
Quine WV: Methods of Logic. Henry Holt & Co, New York, 1950.
Quine WV: The problem of simplifying truth functions.
Am Math Monthly 59:521-531, 1952.
Quine WV: A way to simplify truth functions. Am Math Monthly
62:627-631, 1955.
McCluskey EJ Jr: Minimization of Boolean Functions.
Bell Syst Tech J 36:1417-1444, 1956.
Swets JA: Information retrieval systems.
Science 141:245-250, 1963.
Lusted LB: Introduction to Medical Decision Making.
Charles C Thomas, Springfield, Ill, 1968, p 98-140.
Lusted LB: Decision-making studies in patient management.
N Engl J Med 284:416-424, 1971.
Swets JA: Signal Detection in Medical Diagnosis. In, Computer
Diagnosis and Diagnostic Methods (Jacquez JA, ed). Charles C Thomas,
Springfield, Ill, 1972, p 8-28.
Meadow CT: The Analysis of Information Systems. Second
Edition. Melville Publishing Co., Los Angeles, CA, 1973.
Date CJ: An Introduction to Database Systems.
Addison-Wesley Publishing Co., Reading, Mass., 1975.
Greenes RA, Pappalardo AN, Marble CW, Barnett GO: Design and
implementation of a clinical data management system. Comput Biomed
Res 2:469-485, 1969.
Bowie J and Barnett GO: MUMPS - An economical and efficient
time-sharing system for information management. Comput Programs
Biomed 6:11-22, 1976.
Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles
of natural language pathology phrases identified on first
encounter using the barrier word method. In: Scherrer JR,
Cote RA, Mandil SH, eds. Computerized Natural Medical Language
Processing for Knowledge Representation. Amsterdam:
North-Holland; 1989: 29-39.
CONFIDENTIALITY AND PRIVACY.
Schneier B. Applied Cryptography. Protocols, Algorithms,
and Source Code in C. New York: John Wiley & Sons, Inc. 1996.
Berman JJ, Moore GW. SNOMED-Encoded surgical pathology
databases: a tool for epidemiologic investigation.
Modern Pathology 9:944-950, 1996.
Berman JJ, Moore GW, Hutchin GM. Maintaining patient
confidentiality in the public domain internet autopsy
database. Journal of the American Medical Informatics
Association (JAMIA), Symposium Supplement, pp 328-332, 1996.
Clayton EW, Steinberg KK, Khoury MJ, Thomson E,
Andrews L, Kahn MJ, Kopelman LM, Weiss JO. Informed consent
for genetic research on stored tissue samples. JAMA 1996;1786-92.
RETURN TO BILL MOORE'S HOME PAGE.
Return to Bill Moore's Home Page
ADDITIONAL INFORMATION.
For additional information, contact Bill Moore at:
gwmoore@erols.com
Last Updated: July 3, 1999.