Bill Moore's Computer Translation Tutorial Page


  • WHAT IS A SHORT SENTENCE TRANSLATOR?
    0. BASIC CONCEPTS. All medical English is divided into STOP WORDS (punctuation, numerals, articles, prepositions, pronouns, common adjectives and verbs) and KEY WORDS (everything else). In the IAD, these words are contained global-variables on the MUMPS-based translator. Stop words (particularly of, the, and) are the most common words in any free-text English document, according to George K. Zipf, a Harvard professor of humanities, who wrote a classic treatise on this subject in the late 1940s (mostly using works of James Joyce), then promptly died, so that the whole world could feel safe in ignoring his work. Zipf's Law states that, if you rank the words in a large document in descending order of word-frequency (i.e., most-frequent-word has rank-one, second-most-frequent-word has rank-two, etc.), then word-rank is inversely proportional to word-frequency. According to Zipf's Law, if the rank-one (i.e., most-frequent) word has, say 15,000 occurrences, then the rank-two word will have 7,500 occurrences, the rank-three word will have 3,750 occurrences, etc. There are studies on large documents in English, German, and Chinese, which demonstrate this relationship.
    Stop words are words that nobody would want to search for in an index. An important corollary of Zipf's Law is that the most frequent hundred words in a large document are almost exclusively stop words, and account for well over half the verbiage in the document. They clutter up the indexing globals, and they distract the user from making a well-targeted search. Many stop words are common to all English (articles, prepositions, etc.), but some are idiosyncratic to a particular field, and must be identified and classified by an expert. I am still debating this, but I think that 'disease', 'patient', and 'syndrome' are medical stop-words, because they are so common as to be nearly meaningless. These words would probably NOT be stop words in any other context. By executive fiat, I have declared all one-letter, two-letter, and numeric strings as stop-words, but there are notable exceptions.
    1. PREPROCESSING. The first step in indexing a free-text document is to drop the whole document to lower-case, and perform a preprocessing step, so that certain important concepts won't be lost.
    In the good old days, medical English consisted of numerous, long Latin words, whose only drawback was that they were frequently misspelled. Poor spelling in a free-text document is death to the indexer. These days, we have decent spelling checkers on electronic documents, but medical English is filling up with short words that appear as stop words to the IAD computer program (cd4, p53, etc.). You cannot fail to index these wordlets, or else many important concepts will become unavailable to the user. The IAD approach, not necessarily a very good one, is to perseverate the alphabetic character, so as to form a three-letter term, which is then saved by the index. Therefore, p-53 is indexed under ppp, ki-1 is indexed under kii, etc. Concepts containing stop words, like 't cell' or 'in vivo' are likewise transformed by global ^IADBPREP() into tcell and invivo.
    2. STOP WORDS. After the free-text document has been preprocessed, it is purged of all stop words, including punctuation, numerals, articles, prepositions, pronouns, common adjectives and verbs.
    3. EXACT WORDS. The exact form of every key word (i.e., non-stop-word) in the document is indexed by my program. However, this is only the beginning of the indexing process.
    4. WORD PREFIXES. There are many medical prefixes in which the user may recall only the suffix, but would like to locate cases with the entire word. For example, a user querying for 'carcinoma' might also wish to locate cases of 'adenocarcinoma', where 'adeno' is a common medical prefix. Therefore, my program redundantly indexes 'adenocarcinoma' both as its exact form, ADENOCARCINOMA, and as its reduced form, CARCINOMA.
    5. SINGULAR, COMMON NOUN. It would be annoying for the user to query for LUNG and miss all cases in which contain LUNGS. For that matter, the user asking for LUNG should also not miss cases containing PULMONARY or PULMONIC or even (from the wretched pedant who wrote up that case) PULMON. Therefore, I maintain a global which maps each keyword into the most common, singular, noun form for that word. For example:
    lungs^^lung
    pulmon^^lung
    pulmonaries^^lung
    pulmonary^^lung
    pulmonic^^lung

    Similarly:
    livers^^liver
    hepar^^liver
    hepatic^^liver
    hepato^^liver
    hepatocellular^^liver^cell

    Similarly:
    cystic^^cyst
    cysts^^cyst
    6. HYPHENATED TERMS. Most concepts in medical English exist as multiple word terms, which may be connected by a blank space (mitral valve), a hyphen (mitral-valve), or a null-string (mitralvalve). The word database has undergone this evolution over the past two decades. It is interesting that English, which is historically an amalgam of French and Germanic languages, has a typically French method of expressing a compound-word concept (hyphenation) and a typically German method (agglomeration).
    Space-connected words cannot be recognized by the computer program unless they appear in an appropriate list, which can be obtained by the Barrier Word Method. The IAD program indexes multiple word terms redundantly both as the single component words, or as the corresponding multiple-words, recognized by global ^IADBMULT().

    WHAT IS THE BARRIER WORD METHOD?
    In the following sample text, the stop-words are lower case, and the KEY-WORDS are UPPER CASE:
    ACTINIC DEGENERATION . BASOPHILIC DEGENERATION is present in the UPPER DERMIS . the COLLAGEN BUNDLES there have been replaced by AMORPHOUS MATERIAL staining faintly BASOPHILIC , when stained by HEMATOXYLIN and EOSIN . ELECTRON MICROSCOPIC EXAMINATION of areas of ACTINIC DEGENERATION shows ELASTOTIC MATERIAL as the main component . even though this ELASTOTIC MATERIAL resembles ELASTIC TISSUE in its CHEMICAL COMPOSITION , it differs significantly in appearance from AGED ELASTIC FIBERS , as seen in UNEXPOSED AGED SKIN .
    That is:
    stop-words: KEY-WORDS:
    . ACTINIC
    is DEGENERATION
    present BASOPHILIC
    in DEGENERATION
    the ........ UPPER ........

    This division of the universe into stop-words and key-words supports a valuable trick: the detection of multiple-word terms, such as ACTINIC DEGENERATION, BASOPHILIC DEGENERATION, UPPER DERMIS, COLLAGEN BUNDLES, etc. You will notice that in the above text, the medical multiple-word terms are bounded on either side by a stop-word barrier (including punctuation, numerals, etc.). It is an easy matter to write a program which collects a file of multiple-word terms using this method. You can then revise the list of stop words and key words, and apply the method to the document, to refine the list of stop words.
    8. SNOMED TRANSLATION. Currently, SNOMED-translation of words and concepts in the IAD is performed by simple word-substitution or multiple-word-term substitution on the free-text document (after the above processing), using an enriched SNOMED synonym list.
    WHY IS SNOMED USED?
    SNOMED International, copyright 1994, by the College of American Pathologists (CAP), is much richer than it was in its 1980s edition, and is particularly rich in terms used by pathologists. Furthermore, the current version is rich in Veterinary Medicine terms. The IAD has a limited license to use 'SNOMED-compatible terms', such as 'Lung, NOS' (NOS=not otherwise specified), but NOT T-28000.

    WHY IS UMLS NOT USED FOR THE IAD?
    Why not MeSH or UMLS? It is our understanding, that the products of the U.S. National Library of Medicine, may not be freely distributed outside the USA. Since the Internet is an international resource, this restriction effectively rules out the UMLS for the IAD.

    WHY IS ICD NOT USED FOR THE IAD?
    How about the International Classification of Disease (ICD)? This is a product of the World Health Organization, with ABSOLUTELY NO RESTRICTIONS on worldwide distribution. This is also the coding system used by the U.S. Health Care Financing Administration (HCFA), who pay many medical bills in the USA. Only one problem: it's not as rich in raw terms as SNOMED.

    WHY IS READ CLASSIFICATION NOT USED FOR THE IAD?
    How about the Read classification, used in the United Kingdom? This is the intellectual property of Her Majesty, Elizabeth II. Even single user license are quite expensive.

    WHAT IS ZIPF'S LAW?
    The extremely high frequency of a few words in any running text may be characterized by `Zipf's Law'. If one determines the frequency of each word in the document, and sorts the words in the descending order of frequency, then the most frequent word is said to have `rank 1', the second-most frequent word has `rank 2', etc. Zipf's Law states that word-frequency is inversely proportional to word-rank. On linear graph paper, the plot of word-frequency against word-rank is a hyperbola; on log-log graph paper, the plot of word-frequency against word-rank is a straight line with a negative slope. Large samples of English, German, and Chinese text have been shown to satisfy Zipf's Law. The essential feature of Zipf's Law is that a short list of words account for a large proportion of all word-occurrences in any natural language text.
    Similarly, there is a short list of grammar-formulas, which account for half of all grammar-formula-occurrences in a given text. We assert that, as with words, grammar formulas sufficient to translate a majority of the document belong to a short list, a sort of `Zipf's Law' for grammar formulas.
    The consistent presence of Zipf's Law as a property of both words and grammar formulas in natural language text suggests a strategy for constructing both the LEXICON and GRAMMAR. First, one obtains a frequency distribution for both words and parsing formulas. Then one should fashion the major design considerations around these high-frequency elements in the LEXICON or GRAMMAR. Next, moderate-frequency elements should be fashioned around the models created for high-frequency forms. Finally, low-frequency forms which do not fit the existing design may be flagged for pre- or post- editing.
    There is a very general relationship between rank, r, in a descending order word list and the frequency, f, of each word, called Zipf's Law. Let r=1 for the most common word, r=2 for the second-most common word, etc. Then f*r=k for some constant, k. That is, there is a hyperbolic relationship (straight line with negative slope on log-log graph paper) between word-rank and word-frequency in large text documents. This relationship is approximately true for (1) the works of James Joyce; (2) the two million words in the century-old autopsy files of The Johns Hopkins Hospital; (3) nine million German words from the Heidelberg Cancer Center; and (4) twenty-five million Chinese characters from the Xinhua Printing Factory in Beijing.

    HOW CAN I BUILD A STOPWORD LIST?
    You can produce a first-rate list in a few hours. First, guess. You can obtain a first-draft list of stop words consisting of punctuation, numerals, single letters, double letters, articles, conjunctions, prepositions, auxiliary verbs, and common adjectives and verbs (mild, moderate, come, exhibit, demonstrate, go, etc.). Next, produce a list of multiple-word terms consisting of consecutive non-stop-words bounded on either side by stop words. Finally, eyeball the multiple-word term list, and new stop words will jump out at you. Enrich your stop word list with these new stop words, and repeat the process all over again. There are a few problems with this method: for example, 'in situ' and 'in vitro' will not be discovered. These terms must be added manually to your multiple-word term list.
  • REFERENCES
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM. Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern Pathol. 1988; 1:44-50.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. A SNOMED analysis of three years' accessioned cases of (40,124) of a surgical pathology department: implications for pathology-based demographic studies. Journal of the American Medical Informatics Association (JAMIA), Symposium Supplement 1994 and the Proceedings of the 18th Annual Symposium for Computer Appllications in Medicine (SCAMC), pp. 188-192, 1994
  • Berman JJ, Moore GW. SNOMED-Encoded surgical pathology databases: a tool for epidemiologic investigation. In press, Modern Pathology.
  • Moore GW, Berman JJ, Hanzlick RL, Buchino JJ, Hutchins GM. A prototype international autopsy database: 1625 consecutive fetal and neonatal autopsy facesheets spanning twenty years. Archives Pathol Lab Med 120:782-785, 1996.
  • Moore GW, Berman JJ. Automatic versus manual SNOMED coding: quantitating the differences. Am J Clin Pathol 101:253-256, 1994.
  • Moore GW, Berman JJ. Automatic SNOMED Coding. Journal of the American Medical Informatics Association (JAMIA), Symposium Supplement 1994 and the Proceedings of the 18th Annual Symposium for Computer Applications in Medical Care (SCAMC), pp 225-229, 1994
  • Berman JJ, Moore GW, Donnelly WH, Massey, JK, Craig B. SNOMED Analysis of 40,124 Surgical Pathology Cases. Am J Clin Pathol 102:539-540, 1994
  • Berman JJ, Moore GW. SNOMED-Encoded surgical pathology databases: a tool for epidemiologic investigation. Modern Pathology 9:944-950, 1996.
  • Silverberg SG. SNOMED-Encoded surgical pathology databases: 's no big deal - or is it?. Modern Pathology 9:953-954, 1996
  • Moore GW, Berman JJ. Automatic SNOMED Coding. Journal of the American Medical Informatics Association (JAMIA), Symposium Supplement 1994 and the Proceedings of the 18th Annual Symposium for Computer Appllications in Medicine (SCAMC), pp 225-229, 1994
  • Moore GW, Berman JJ. Automatic versus manual SNOMED coding: quantitating the differences. Am J Clin Pathol 101:253-256, 1994
  • Moore GW, Hutchins GM, Berman JJ. Object oriented programming system for autopsy retrieval. Modern Pathol 5:100A, 1992 and Lab Invest 66:100A, 1992 (abstract). Moore GW, Berman JJ, Hanzlick RL, Buchino JJ,
  • Hutchins GM. A prototype national autopsy databank: 1,625 consecutive fetal and neonatal autopsy facesheets spanning twenty years. Conference on Restructuring Autopsy Practice for Health Care Reform, May 25-26, 1995. Willard Inter-Continental Hotel, Washington, D.C.
  • Moore GW, Berman JJ. Object-oriented English-to-Snomed translator using Transoft + Hyperpad. Symposium on Computer Applications in Medical Care 15:973-975, 1991
  • Moore GW, Hutchins GM, Berman JJ. Object-oriented retrieval system for the Johns Hopkins autopsy database. MedInfo 92, Lun KC, Degoulet P, Piemme TW, Rienhoff, eds. Elsevier, Amstedam, 1992, p 1613
  • Berman JJ. Solving Quality assurance problems with object scripting languages. Artificial Intelligence in Medicine 3:161-171, 1991
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM. Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern Pathol. 1988; 1:44-50.
  • Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In: Scherrer JR, Cote RA, Mandil SH, eds. Computerized Natural Medical Language Processing for Knowledge Representation. Amsterdam: North-Holland; 1989: 29-39.
  • Hutchins GM and the Autopsy Committee of the College of American Pathologists: Practice guidelines for autopsy pathology: autopsy reporting. Arch Pathol Lab Med. 1995; 119:123-130.
  • Hutchins GM and the Autopsy Committee of the College of American Pathologists: Practice guidelines for autopsy pathology: autopsy reporting. Arch Pathol Lab Med. 1995; 119:123-130.
  • Cote RA, Rothwell DJ. The classification-nomenclature issues in medicine: a return to natural language. Medical Informatics 14:25-41, 1989.
  • Rothwell DJ, Cote RA, Brochu L. The systematized nomenclature of human and veterinary medicine, SNOMED International Microglossary for pathology. College of American Pathologists, Northfield, IL, 1993.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. SNOMED Analysis of 40,124 Surgical Pathology Cases. Am J Clin Pathol 102:539-540, 1994.
  • Moore GW, Miller RE, Hutchins GM: Microcomputer translator for medical text: theorem verification for chapter two of Zeman's modal logic. Adv Math Comput Med 7:1621-1633, 1986.
  • Woods W: Transition network grammars for natural language analysis. Commun Assn Comp Mach 13:591-606, 1970.
  • Legreaux RM and Diehl S. Ask Byte: Se Habla ASCII? Byte. 17(6):365,1992.
  • Schwanke M. Machinelle Uebersetzung. Ein Ueberblick ueber Theorie und Praxis. Berlin, Heidelberg, New York: Springer-Verlag, 1991, pp. 11-45.
  • Pogson G: Controlled English: Enlightenment through Constraint. Language Technology 6:22-25, 1988.
  • Ogden CK: The System of Basic English. New York, 1934.
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM. Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern Pathol. 1988; 1:44-50.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. A SNOMED analysis of three years' accessioned cases of (40,124) of a surgical pathology department: implications for pathology-based demographic studies. Journal of the American Medical Informatics Association (JAMIA), Symposium Supplement 1994 and the Proceedings of the 18th Annual Symposium for Computer Appllications in Medicine (SCAMC), pp. 188-192, 1994
  • Moore GW, Hutchins GM, Miller RE. Strategies for searching medical language text: Distribution of words in the anatomical diagnoses of 7000 autopsied patients. Am J Pathol 115:36-41, 1984.
  • Moore GW, Hutchins GM, Miller RE. Examination of disease names using non-Abelian symbolic logic. Methods Inform. Med. 25:109-115, 1986.
  • Moore GW, Polacsek RA, Erozan YS, de la Monte SM, Miller RE, Hutchins GM, Riede UN. Multilingual translation techniques in the analysis of narrative medical text. Comput. Programs Biomed. 22:35-42, 1986.
  • Moore GW, Miller RE, Hutchins GM. Determining cause of death in 45,564 autopsy reports. Theoret. Med. 9:179-186, 1988.
  • Moore GW, Wakai I, Satomura Y, Giere W. TRANSOFT: Medical translation expert system. Artif Intell Med 1:149-157, 1989.
  • Moore GW, Berman JJ. Performance Analysis of Manual and Automated Systematized Nomenclature of Medicine (SNOMED) Coding. Am J Clin Pathol. 101:253-256, 1994.
  • Moore GW, Miller RE, Hutchins GM, Riede UN, Polacsek RA. Multilingual translation techniques in the analysis of narrative medical text. Proc. 9th Ann. Symp. Comp. Appl. Med. Care, November 10-13, 1985, Baltimore, MD.
  • Moore GW, Miller RE, Hutchins GM. Determining cause of death in 45,504 autopsy reports. Workshop on Ethical and Quality Aspects of Computerized Medical Person Registers. Stockholm, Sweden, March 26-27, 1987.
  • Moore GW, Hutchins GM, Boitnott JK, Miller RE, Polacsek RA. Word root translation of 45,564 autopsy reports into MeSH titles. Proceedings of Symposium on Computer Applications in Medical Care, Washington DC, November 1-4, 1987.
  • Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In, Scherrer JR, Cote RA, and Mandil SH, eds., Computerized Natural Medical Language Processing for Knowledge Representation. North-Holland, Amsterdam, 1989.
  • Tersmette KWF, Scott AF, Moore GW, Matheson NW, Miller RE. Barrier word method for detecting molecular biology multiple word terms. Proceedings of 12th Annual Symposium of Computer Applications in Medical Care, November 6-9, 1988, Washington, D.C.
  • Moore GW, Berman JJ. Automatic SNOMED Coding. Proceedings of JAMIA Eighteenth Annual Symposium on Computer Applications in Medical Care 18:225, 1994.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. A SNOMED analysis of three years' accessioned cases (40,124) of a surgical pathology department: implications for pathology-based demographic studies. Symposium on Computer Applications in Medical Care 18: 188, 1994.
  • Altman RB, Oliver DE (1994) Extraction of SNOMED Concepts from Medical Record Texts. Proceedings of JAMIA Eighteenth Annual Symposium on Computer Applications in Medical Care. November 5-9, Washington D.C. p. 179, 1994.
  • Barrows RC, Cimino JJ, Clayton PD. (1994) Mapping Clinically Useful Terminology to a Controlled Medical Vocabulary. Proceedings of JAMIA Eighteenth Annual Symposium on Computer Applications in Medical Care. November 5-9, Washington D.C. p. 211, 1994.
  • Bell DS, Cimino JJ, Evans DA, Hersh WR, Huff ST. (1994) Toward a Medical-Concept Representation Language. J Am Med Informatics Assoc, May-Jun; 1(3):207-217, 1994.
  • Cimino JJ, Hripcsak G, Johnson BB, Clayton PD: Designing an Introspective,Controlled Medical Vocabulary, in Kingsland LK, ed.: Proceedings of the Thirteenth Annual; Symposium on Computer Applications in Medical Care; Washington, DC; November 1989:513-518, 1989.
  • Cimino JJ. Coding systems in health care. In van Bemmel JH, McCray AT, eds.: Yearbook of Medical Informatics, International Medical InformaticsAssociation, Rotterdam, 1995:71-85, 1995.
  • Cimino JJ. Vocabulary and health care information technology: state of the art. Journal of the American Society for Information Science;1995;45(10):777-782, 1995.
  • Sager N, Lyman M, Nhan NT, Tick LJ. (1994) Automatic Encoding into SNOMED III: A Preliminary Investigation. Proceedings of JAMIA Eighteenth Annual Symposium on Computer Applications in Medical Care. November 5-9, Washington D.C. p. 230, 1994.
  • Wingert F. (1988) Medical Linguistics: Automated Indexing into SNOMED. Crit Rev Med Inform; 1(4):333-403, 1988.
  • Wingert F, Rothwell D, Cote RA. (1988) Automated Indexing into SNOMED and ICD. Proceedings of IFIP-IMIA WG6 International Working Conference on Computerized Natural Medical Language Processing for Knowledge Representation, WHO Geneva, September 12-15, 1988.
  • Wingert F. (1987) Automated Indexing of SNOMED Statements into ICD. Methods Inform Med, Jul; 26(3):93-8, 1987.
  • Wingert F. An Indexing System for SNOMED. Methods Inform Med, Jan;25(1):22-30, 1986.
  • Wingert F. Automated Indexing Based on SNOMED. Methods Inf Med, Jan;24(1): 27-34, 1985.
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM. Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern. Pathol. 1:44-50, 1988.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. SNOMED analysis of 40,124 surgical pathology cases. ASCP/CAP fall, 1994, meeting, Washington, DC.
  • Moore GW, Miller RE, Hutchins GM. Determining cause of death in 45,504 autopsy reports. Workshop on Ethical and Quality Aspects of Computerized Medical Person Registers. Stockholm, Sweden, March 26-27, 1987.
  • Moore GW, Hutchins GM, Boitnott JK, Miller RE, Polacsek RA. Word root translation of 45,564 autopsy reports into MeSH titles. Proceedings of SCAMC -- Symposium on Computer Applications in Medical Care, Washington DC, November 1-4, 1987.
  • Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In, Scherrer JR, Cote RA, and Mandil SH, eds., Computerized Natural Medical Language Processing for Knowledge Representation. North-Holland, Amsterdam, 1989.
  • Moore GW, Berman JJ. Automatic SNOMED Coding. Symposium on Computer Applications in Medical Care 18: 1994.
  • Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B. A SNOMED analysis of three years' accessioned cases (40,124) of a surgical pathology department: implications for pathology- based demographic studies. Symposium on Computer Applications in Medical Care 18: 1994.
  • Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In: Scherrer JR, Cote RA, Mandil SH, eds. Computerized Natural Medical Language Processing for Knowledge Representation. Amsterdam: North-Holland; 1989: 29-39.
  • Moore GW, Berman JJ. Automatic versus manual SNOMED coding: quantitating the differences. Am J Clin Pathol 101:253-256, 1994
  • Giere W, Moore GW. Translating English into German using VA File Manager. M Computing, 1:16-23, 1993.
  • Baumann RP: Knowledge representation in an autopsy data bank. In, Computerized Natural Medical Language Processing for Knowledge Representation (Cote R, Scherrer J-R, Mandil S, (eds), North-Holland, Amsterdam, 1988.
  • Baumann RP and Brandon G: SNOMED in four languages: English, French, German, and MUMPS. In, MEDINFO 83 (van Bemmel JH, Ball MJ, Wigertz O (eds), North-Holland, Amsterdam, 1983, pp 112-115.
  • Locke, W.N. and Booth, A.D. (eds.): Historical Introduction. In, Machine Translation of Languages. The Technology Press of Massachusetts Institute of Technology and John Wiley & Sons, New York, 1955.
  • Bar-Hillel B: Language and Information. Selected Essays on Their Theory and Application. Addison-Wesley Publishing Co., Inc., Reading, Mass., 1964.
  • Chomsky N: Aspects of the Theory of Syntax. The Massachusetts Institute of Technology Press, Cambridge, Mass., 1965.
  • Language and Machines. Computers in Translation and Linguistics. Publication 1416. A report by the Automatic Language Processing Advisory Committee, Division of Behavioral Sciences, National Academy of Sciences, National Research Council, Washington, D.C., 1966, pp. 1-34.
  • Loh S-C: Machine translation: past, present, and future. ALLC Bulletin 4:105-114, 1976.
  • Jordan SR, Brown AFR, Hutton FC: Computerized Russian translation at ORNL. J. Am. Soc. Info. Sci. 28:26-33 1977.
  • Hutchins WJ: Progress in documentation. Machine translation and machine-aided translation. J. Document 34:119-159, 1978.
  • Carbonell, J.G., Cullingford, R.E., Gershman, A.V.: Steps toward knowledge-based machine translation. IEEE Trans. PAMI-3:376-392, 1981.
  • Winograd, T.: Computer software for working with language. Sci. Am. 251:130-145, 1984.
  • Tucker AB Jr: A perspective on machine translation: Theory and Practice. Commun. ACM 27:322-329, 1984.
  • Hutchins, W. G.: Machine Translation: Past, Present and Future. Ellis Horwood, 1986.
  • Wingert, F. Lecture notes in Medical Informatics. In, Medical Informatics. An Introduction (Lindberg, D.A.B. Reichertz, P.L. (eds.)). Springer-Verlag, New York, 1981, pp 22-25.
  • McCray AT, Sponsler JL, Brylawski B, Browne A: The role of lexical knowledge in biomedical text understanding. Proceedings of the Eleventh Annual Symposium on Computer Applications in Medical Care Nov 1-4, 1987. Washington, DC: IEEE Computer Society Press. 1987 pp. 103-107.
  • Bar-Hillel Y: The present status of automatic translation of languages. Adv Comput l:9l-l63, l960.
  • Aho AV and Johnson SC: LR parsing. Comput Surveys 6:99-124, 1974.
  • Bochmann GV: Semantic evaluation from left to right. Commun ACM l9:55-62, l976.
  • Pager D: A practical general method for constructing LR(k) parsers. Acta Inf 7:249-268, l977.
  • Alekseev VD: Automatic construction of descending left-hand parsers. Auto Document Math Ling l6:4l-47, l982.
  • Aho AV and Ullman JD: Principles of Compiler Design. Addison-Wesley Publishing Co., Reading, Mass, l979.
  • Barnett GO, Justice NS, et al: COSTAR - A computer-based medical information system and data language for ambulatory practices. Computer l2:9-l7, l979.
  • Horowitz GL, Bleich HL: PaperChase: A computer program to search the medical literature. N Engl J Med 305:924-930, 1981.
  • Robboy SJ, Altshuler BS, Chen HY: Retrieval in a computer- assisted pathology encoding and reporting system (CAPER). Am J Clin Pathol 75:654-661, 1981.
  • Wong RL, Reno JD, Hain TC, Platt RC, Gaynon PS, Joseph DM: Profile of a dictionary compiled from scanning over one million words of surgical pathology narrative text. Comput Biomed Res 13:382-398, 1980.
  • Salton G: Dynamic Information and Library Processing. Prentice-Hall, Inc, Englewood Cliffs, NJ, 1975.
  • Giere W: Foundations of clinical data automation in cooperative programs. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
  • Zhang Q: Easy entry of Chinese character set symbols. Proc 5th Ann Symp Comp Appl Med Care (Heffernan HG, ed). Washington, D.C., 1981, pp 143-149.
  • Zipf GK: On the Economy of Words. Chapter 2 in, Human Behavior and The Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Inc., Cambridge, MA, 1949, pp. 19-55.
  • Condon EU: Statistics of vocabulary. Science 67:300, 1928.
  • Josselson HH: Automatic translation of languages since 1960: A linguist's view. Adv Comput 11:1-58, 1971.
  • Joshi AK. Natural language processing. Science 253:1242-1249.
  • Aho AV, Ullman JD: Principles of Compiler Design. Addison-Wesley Publishing Co., Reading, Mass, 1979.
  • Sager N, Grisham R: The restriction language for computer grammars of natural language. Commun ACM 18:390-400, 1975.
  • Hirschman L, Story G, Marsh E, Lyman M and Sager N: An experiment in automated health care evaluation from narrative medical records. Comput Biomed Res 14:447-463, 1981.
  • Story G, Hirschman L: Data base design for natural language medical data. J Med Syst 6:77-88, 1982.
  • Sager N, Bross IDJ, Story G, Bastedo P, Marsh E, Shedd D: Automated encoding of clinical narrative. Comput Biol Med 12:43-56, 1982.
  • Horowitz GL and Bleich HL: PaperChase: A computer program to search the medical literature. N Engl J Med 305:924-930, 1981.
  • Robboy SJ, Altshuler BS, Chen HY: Retrieval in a computer- assisted pathology encoding and reporting system (CAPER). Am J Clin Pathol 75:654-661, 1981.
  • Ledley RS and Lusted LB: Reasoning foundations of medical diagnosis. Science 130:9-21, 1959.
  • Kayser K: Logic and diagnosis. Methods Inf Med 14:76-80, 1975.
  • Vasconcellos M: Overview: machine translation. BYTE 18:152-166, 1993.
  • Hovy E: How MT works. BYTE 18:167-176, 1993. Resource guide: machine-translation software. BYTE 18:185-186, 1993.
  • Pogson G: Controlled English: Enlightenment through Constraint. Language Technology 6:22-25, 1988.
  • Ogden CK: The System of Basic English. New York, 1934.
  • Schwanke M: Machinelle Uebersetzung. Ein Ueberblick ueber Theorie und Praxis. Berlin, Heidelberg, New York: Springer-Verlag, 1991, pp. 91-95.
  • Woods W: Transition network grammars for natural language analysis. Commun Assn Comp Mach 13:591-606, 1970.
  • Vasconcellos M: Overview: machine translation. BYTE 18:152-166, 1993.
  • Nirenburg S (ed): Machine Translation. Theoretical and Methodological Issues. London: Cambridge University Press, 1987.
  • Vasconcellos M, Leon M: SPANAM and ENGSPAN: Machine translation at the Pan American Health Organization. Comput Linguist 11:122-136, 1985.
  • INTERNET: The Oxford Text Archive: ftp://ota.ox.ac.uk. A fantastic collection of various wordlists, including an 8 megabyte unabridged dictionary; various English language lists (British, American, Australian, Shakespeare), as well as lists in Afrikaans, Chinese, Croatian, Danish, Dutch, Esperanto, Finnish, French, German, Hindi, Hungarian, Italian, Japanese, Latin, Norwegian, Polish, Russian, Spanish, Swahili, Swedish, Yiddish.
  • Vasconcellos M: Overview: machine translation. BYTE 18:152-166, 1993.
  • Hovy E: How MT works. BYTE 18:167-176, 1993.
  • Resource guide: machine-translation software. BYTE 18:185-186, 1993.
  • King M: Are There any Lessons to be Learned from Machine Translation? In, Scherrer JR, Ct, RA, and Mandil SH (eds.), Computerized Natural Medical Language Processing for Knowledge Representation. Amsterdam: Elsevier Science Publishers B.V. pp. 73-82, 1989.
  • Nirenburg, S. (ed.): Machine Translation. Theoretical and Methodological Issues. London: Cambridge University Press, 1987.
  • Hutchins WJ: Machine Translation: Past, Present, Future. Chichester: Ellis Horwood Ltd, 1986.
  • Vasconcellos M and Leon M: SPANAM and ENGSPAN: Machine translation at the Pan American Health Organization. Comput Linguist 11:122-136, 1985.
  • Tucker A: A perspective on machine translation: theory and practice. Commun Assn Comp Mach 27:322-329, 1984.
  • Davis RG: FileMan: A User Manual. Bethesda, MD: National Association of VA Physicians, 1987.
  • Woods W: Transition network grammars for natural language analysis. Commun Assn Comp Mach 13:591-606, 1970.
  • Barnett GO, Cimino JJ, Hupp JA, Hoffer EP: DXplain, an evolving diagnostic decision support system. JAMA 258:67-74, 1987.
  • Zipf GK: On the economy of words. In, Human Behavior and the Principle of Least Effort, ch. 2, p.19. Cambridge, MA: Addison-Wesley Press, Inc., 1949.
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM: Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern Pathol 1:44-50, 1988.
  • Giere W: Foundations of clinical data automation in cooperative programs. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press, 1981. pp. 1142-1148.
  • Zhang Q: Easy entry of Chinese character set symbols. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press, 1981. pp. 143-149.
  • Giere W: Befunddokumentation und Arztbriefschreibung im Krankenhaus. Taunusstein: Media, 1986.
  • Giere W: Foundations of clinical data automation in cooperative programs. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press, 1981. pp. 1142-1148.
  • Zhang Q: Easy entry of Chinese character set symbols. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed.) Washington, DC: IEEE Computer Society Press, 1981. pp. 143-149.
  • Tucker A: A perspective on machine translation: theory and practice. Commun Assn Comp Mach 27:322-329, 1984.
  • Vasconcellos M and Leon M: SPANAM and ENGSPAN: Machine translation at the Pan American Health Organization. Comput Linguist 11:122-136, 1985.
  • Hutchins WJ: Machine Translation: Past, Present, Future. Chichester: Ellis Horwood Ltd, 1986.
  • Moore GW, Riede UN, Polacsek RA, Miller RE, and Hutchins GM: Automated translation of German to English medical text. Am J Med 81:103-111, 1986.
  • Davis RG: FileMan: A User Manual. Bethesda, MD: National Association of VA Physicians, 1987.
  • Nirenburg, S. (ed.): Machine Translation. Theoretical and Methodological Issues. London: Cambridge University Press, 1987.
  • Moore GW, Boitnott JK, Miller RE, Eggleston JC, Hutchins GM: Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Modern Pathol 1:44-50, 1988.
  • Wilks Y: A preferential, pattern-seeking, semantics for natural language inference. Artif. Intel. 6:53-74, 1975.
  • Arnold D: General view of the design methodology. In: Multilingua 5(1986)3: special issue. HL Somers (ed.) Amsterdam: Mouton de Gruyter. EUROTRA Special Issue, pp. 136-139.
  • King M: Are there any lessons to be learned from machine translation? Proceedings of the International Working Conference, World Health Organization, Geneva, Switzerland, 1988, pp 2.1-2.10.
  • Offerhaus GJA, Tersmette AC, Moore GW, Hershey J, Polacsek RA: Dutch respelling rules for English and German medical word lists. Method. Inform. Med. 26:99-103, 1987.
  • Yu C-Y, Moore GW, Unschuld PU: Romanized Chinese respelling rules for an English medical word list. Proceedings from SCAMC, Symposium on Computer Applications in Medical Care, Washington, DC, November 1-4, 1987, pp 133-136.
  • Wingert F. Lecture notes in Medical Informatics. In, Medical Informatics. An Introduction (Lindberg DAB & Reichertz PL (eds.)). Springer-Verlag, New York, 1981, pp 22-25.
  • McCray AT, Sponsler JL, Brylawski B, Browne A: The role of lexical knowledge in biomedical text understanding. Proceedings of the Eleventh Annual Symposium on Computer Applications in Medical Care Nov 1-4, 1987. Washington, D.C.: IEEE Computer Society Press. 1987 pp 103-107.
  • Josselson HH: Automatic translation of languages since 1960: A linguist's view. Adv Comput 11:1-58, 1971.
  • Chomsky N: Aspects of the Theory of Syntax. The Massachusetts Institute of Technology Press, Cambridge, Mass., 1965.
  • Chomsky N: Language and Mind. Harcourt Brace Jovanovich, New York, 1968.
  • Loh S-C: Machine translation: past, present, and future. ALLC Bulletin 4:l05-ll4, 1976.
  • Wilks Y: An intelligent analyzer and understander of English. Commun ACM l8:264-274, l975.
  • Wilks Y: A preferential, pattern-seeking, semantics for natural language inference. Artif Intel 6:53-74, l975.
  • Wilks Y: Frames for machine translation. New Scientist ??:802-803, 1977.
  • Hutchins WJ: Progress in documentation. Machine translation and machine-aided translation. J Document 34:119-159, 1978.
  • Garfield E.: Current comments. Artificial intelligence: Using computers to think about thinking. Part l. Representing knowledge. Curr Contents 49:5-17, 1983.
  • Schank RC, Childers PG: Experiments in Artificial Intelligence. Computerworld 10:1-28, 1984.
  • Knuth DE: A characterization of parenthesis languages. Inf Control 11:269-289, l967.
  • Bell JR: A new method for determining linear precedence functions for precedence grammars. Commun ACM l2:3l6-333, l969.
  • Earley J: An efficient context-free parsing algorithm. Commun ACM 13:94-102, 1970.
  • Aho AV, Johnson SC: LR parsing. Comput Surveys 6:99-,24, 1974.
  • Bochmann GV: Semantic evaluation from left to right. Commun ACM 19:55-62, 1976.
  • Pager D: A practical general method for constructing LR(k) parsers. Acta Inf 7:249-268, 1977.
  • Alekseev VD: Automatic construction of descending left-hand parsers. Auto Document Math Ling l6:41-47, 1982.
  • Aho AV, Ullman JD: Principles of Compiler Design. Addison-Wesley Publishing Co., Reading, Mass, l979.
  • Sager N, Grisham R: The restriction language for computer grammars of natural language. Commun ACM 18:390-400, 1975.
  • Hirschman L, Story G, Marsh E, Lyman M, Sager N: An experiment in automated health care evaluation from narrative medical records. Comput Biomed Res 14:447-463, 1981.
  • Story G and Hirschman L: Data base design for natural language medical data. J Med Syst 6:77-88, 1982.
  • Sager N, Bross IDJ, Story G, Bastedo P, Marsh E, Shedd D: Automated encoding of clinical narrative. Comput Biol Med 12:43-56, 1982.
  • Wolff S: The use of morphosemantic regularities in the medical vocabulary for automatic lexical coding. Methods Inf Med 23:195-203, 1984.
  • Fenichel RR and Barnett GO: An application-independent subsystem for free-text analysis. Comput Biomed Res 9:159-167, 1976.
  • Weiner JM: Text analysis and basic concept structures. Inf Proc Mgt 19:313-319, 1983.
  • Moore GW, Haupt HM, Hutchins GM: A hypothesis test for causal explanations in human pathology: Evaluation of pulmonary edema in 181 autopsied patients with leukemia. Math Biosci 62:253-279, 1982.
  • Moore GW and Hutchins GM: Symbolic logic analysis of congenital heart disease. Pathol Res Pract 171:59-85, 1981.
  • Moore GW, Hutchins GM: Symbolic logic analysis of cause of death in humans: Application to l08 patients after coronary artery bypass graft surgery. J Theor Biol 92:267-291, 1981.
  • Riede UN, Moore GW, Sandritter W: Symbolic logic as a new method in quantitative organelle pathology. Exp Mol Pathol 33:259-282, 1980.
  • Riede UN and Moore GW: Symbolic logic as a new method in pathobiology and toxicology. CRC Press, 1983.
  • Moore GW, Hutchins GM: Consistency versus completeness in medical decision making: Application to l55 patients autopsied after coronary artery bypass graft surgery. Med Inf 8:197-207, 1983.
  • Moore GW and Hutchins GM: A Hintikka possible worlds model for certainty levels in medical decision making. Synthese 48:89-119, 1981.
  • Moore GW, Hutchins GM, Bulkley BH: Certainty levels in the nullity method of symbolic logic: Application to the pathogenesis of congenital heart malformations. J Theor Biol 76:53-81, 1979.
  • Gladkij AV and Mel'cuk IA: Elements of Mathematical Linguistics. (Lehrberger, J., ed.). Mouton Publishers, Berlin, 1983.
  • Feigenbaum EA and McCorduck P: The Fifth Generation: Artificial Intelligence and Japan's Computer Challenge to the World. Addison Wesley, Reading, Ma., 1983.
  • Becker JD: Multilingual word processing. Sci Am 25l:96-107, 1984.
  • Walther von AJ: Translators gain fluency. Multilingual word processors point to universal communication. InfoWorld 6:35-37, 1984.
  • Bulkeley WM: Computers gain as language translators even though perfect not they always. Wall Street Journal, February 6, 1985, p 29.
  • Tucker AB, Jr: A perspective on machine translation: Theory and Practice. Commun ACM 27:322-329, 1984.
  • Jordan SR, Brown, AFR, Hutton FC: Computerized Russian translation at ORNL. J Am Soc Inf Sci 28:26-33, 1977.
  • Garcia-Hidalgo I, Dunham G: An experiment in English Spanish automated translation of medical language data. Methods Inf Med 20:38-46, 1981.
  • Hoffman La Roche (editor): Roche Lexikon Medizin, Urban & Schwarzenberg, Munich, l984.
  • Dunham GS, Pacak MG, Pratt AW: Automated indexing of pathology data. J Am Soc Inf Sci 29:81-90, 1978.
  • Platt RC, Wong RL, Lantner KW, Gaynon PS: Polars: A pathology on-line logging and reporting system. Comput Biomed Res 7:83-99, 1974.
  • Sharpe TC, Clark DE: General purpose information handling techniques for pathological data. Comput Biol Med 5:221-233, 1975.
  • Okubo RS, Russell WS, Dimsdale B, Lamson BG: Natural language storage and retrieval of medical diagnostic information. Experience at the UCLA Hospital and Clinics over a 10-year period. Comput Prog Biomed 75:105-130, 1975.
  • Frieder G and Slocum PJ: The lexical attributes of medical name files. Comput Biol Med 8:81-87, 1978.
  • Wong RL, Reno JD, Hain TC, Platt RC, Gaynon PS, Joseph DM: Profile of a dictionary compiled from scanning over one million words of surgical pathology narrative text. Comput Biomed Res 13:382-398, 1980.
  • Hieb BR: A word processor-based pathology reporting system. Am J Clin Pathol 75:357-363, 1981.
  • Aller RD: Computers in anatomic pathology. Clin Lab Med 3:133-147, 1983.
  • Coles EC, Slavin G: An evaluation of automatic coding of surgical pathology reports. J Clin Pathol 29:621-625, 1976.
  • Aller RD, Robboy SJ, Poitras JW, Altshuler BS, Cameron M, Prior MC, Miao S, and Barnett GO: Computer-assisted pathology encoding and reporting system (CAPER). An on-line computer system developed at the Massachusetts General Hospital. Am J Clin Pathol 68:715-720, 1977.
  • Foulis PR, Norbut AM, Mendelow H and Kessler, G.F.: Pathology accessioning and retrieval system with encoding by computer (PARSEC). Am J Clin Pathol 73:748-753, 1980.
  • Robboy SJ, Altshuler BS and Chen HY: Retrieval in a computer-assisted pathology encoding and reporting system (CAPER). Am J Clin Pathol 75:654-661, 1981.
  • Cote RA, Robboy S: Progress in medical information management: Systematized nomenclature of medicine (SNOMED). JAMA 243:756-762, 1980.
  • Blekhman MS: Formalization of inexplicit anaphoric links in systems for processing natural-language texts (with reference to English-language texts). Auto Document Math Ling 16:98-109, 1982.
  • Allen JF, Perrault CR: Analyzing intention in dialogues. Artif Intel 15:143-178, 1980.
  • Blois MS: Clinical judgement and computers. N Engl J Med 303:192-197, 1980.
  • Damerau FJ: A technique for computer detection and correction of spelling errors. Commun ACM 7:171-176, 1964.
  • Zamora EM, Pollock JJ and Zamora A: The use of trigram analysis for spelling error detection. Inf Proc Mgt 17:305-316, 1983.
  • Hall PAV and Dowling GR: Approximate string matching. Comput Surveys 12:381-402, 1980.
  • Peterson JL: Computer programs for detecting and correcting spelling errors. Commun ACM 23:676-687, 1980.
  • Durham I, Lamb DA and Saxe JB: Spelling correction in user interfaces. Commun ACM 26:764-773, 1983.
  • Pollack JJ, Zamora A: Automatic spelling correction in scientific and scholarly text. Commun ACM 27:358-368, 1984.
  • Humphrey SM: File maintenance of MeSH Headings in MEDLINE. J Am Soc Inf Sci 35:34-44, 1984.
  • Moore GW, Hutchins GM, Miller RE: Strategies for searching medical natural language text: Distribution of words in the anatomical diagnoses of 7000 autopsied patients. Am J Pathol 115:36-41, 1984.
  • Arbib MA and Manes EG: Monoids and Groups. Chapter 3 in, Arrows, Structures and Functors. The Categorial Imperative. Academic Press, New York, 1975.
  • Wos L, Overbeek R, Lusk E and Boyle J: Automated Reasoning. Introduction and Applications. Prentice Hall, Englewood Cliffs, New Jersey, 1984.
  • Clocksin WF, Mellish CS: Programming in Prolog. Springer-Verlag, Heidelberg, 1981.
  • Malpas J: Programming in logic. Dr Dobb's J l0:36-4l, l985.
  • von Hansemann D: Ueber das konditionale Denken in der Medizin und seine Bedeutung fr die Praxis. August Hirschwald Verlag, Berlin, 1912, pp. 1-31.
  • Sadegh-zadeh K: Grundlagenproblem einer Theorie der klinischen Praxis. Teil l: Explikation des medizinischen Diagnosebegriffs. Metamed 1:76-102, 1977.
  • Ledley RS, Lusted LB: Reasoning foundations of medical diagnosis. Science 130:9-21, 1959.
  • Warner HR, Toronto AF, Veasey LG, Stephenson R: A mathematical approach to medical diagnosis. Application to congenital heart disease. JAMA l77:177-183, 1961.
  • Stein MA, Winter J: Theory development in medical decision-making. Int J Biomed Comput 5:147-159, 1974.
  • Kayser K: Logic and diagnosis. Methods Inf Med 14:76-80, 1975.
  • Weiss SM, Kulikowski CA, Amarel S, Safir A: A model-based method for computer-aided medical decision making. Artif Intel 11:145-172, 1978.
  • Weiss SM, Kulikowski CA and Safir A: Glaucoma consultation by computer. Comput Biol Med 8:25-40, 1978.
  • Pauker SG, Gorry GA, Kassirer JP, Schwartz WB: Towards the simulation of clinical cognition. Taking a present illness by computer. Am J Med 60:981-996, 1976.
  • Szolovitz P, Pauker SG: Categorical and probabilistic reasoning in medical diagnosis. Artif Intel 11:115-144, 1978.
  • Shortliffe EH, Buchanan BG: A model of inexact reasoning in medicine. Math Biosci 23:351-379, 1975.
  • Shortliffe EH, Buchanan BG and Feigenbaum EA: Knowledge engineering for medical decision making: A review of computer-based clinical decision aids. Proc IEEE 67:1207-1224, 1979.
  • Duda RO and Shortliffe EH: Expert systems research. Science 220:261-268, 1983.
  • Pauker SG and Kassirer JP: The threshold approaches to clinical decision making. N Engl J Med 302:1109-1117, 1980.
  • Schwartz WB, Wolfe HJ and Pauker SG: Pathology and probabilities. A new approach to interpreting and reporting biopsies. N Engl J Med 305:917-923, 1981.
  • Miller RA, Pople HE Jr and Myers JD: Internist-I, an experimental computer-based diagnostic consultant for general internal medicine. N Engl J Med 307:468-476, 1982.
  • Miller RA: Internist-l/Caduceus: Problems facing expert consultant programs. Methods Inf Med 23:9-l4, l984.
  • Halpern JY and Rabin MO: A logic to reason about likelihood. IBM Research Report RJ 4313, 1984.
  • Zeman JJ: Modal Logic. The Lewis-Modal Systems, Oxford at the Clarendon Press, London, 1973.
  • Anderson JM and Johnstone HW Jr: Natural Deduction. The Logical Basis of Axiom Systems. Wadsworth Publishing Co, Inc., Belmont, CA, 1962.
  • Quine WV: Theory of Deducation, Harvard Cooperative Society, Cambridge, Mass, 1948, pp 65-81.
  • Quine WV: Methods of Logic. Henry Holt & Co, New York, 1950.
  • Quine WV: The problem of simplifying truth functions. Am Math Monthly 59:521-531, 1952.
  • Quine WV: A way to simplify truth functions. Am Math Monthly 62:627-631, 1955.
  • McCluskey EJ Jr: Minimization of Boolean Functions. Bell Syst Tech J 36:1417-1444, 1956.
  • Swets JA: Information retrieval systems. Science 141:245-250, 1963.
  • Lusted LB: Introduction to Medical Decision Making. Charles C Thomas, Springfield, Ill, 1968, pp. 98-140.
  • Lusted LB: Decision-making studies in patient management. N Engl J Med 284:416-424, 1971.
  • Swets JA: Signal Detection in Medical Diagnosis. In, Computer Diagnosis and Diagnostic Methods (Jacquez JA, ed). Charles C Thomas, Springfield, Ill, 1972, pp. 8-28.
  • Meadow CT: The Analysis of Information Systems. Second Edition. Melville Publishing Co., Los Angeles, CA, 1973.
  • Lancaster FW: Evaluating the Effectiveness of Information Services. Chapter 9, In, Information Retrieval Systems: Characteristics, Testing and Evaluation. Second Edition. John Wiley & Sons, New York, 1979, pp 121-139.
  • Date CJ: An Introduction to Database Systems. Addison-Wesley Publishing Co., Reading, Mass., 1975.
  • Greenes RA, Pappalardo AN, Marble CW, Barnett GO: Design and implementation of a clinical data management system. Comput Biomed Res 2:469-485, 1969.
  • Bowie J and Barnett GO: MUMPS - An economical and efficient time-sharing system for information management. Comput Programs Biomed 6:11-22, 1976.
  • Dayhoff R (editor): ANS MUMPS Programmers' Reference Manual 1981 and ANS MUMPS Language Standard. MUMPS Users Group, Washington, DC, 1981.
  • Watanabe T, Ohsawa T, Suzuki T: A simple database language for personal computers. Commun ACM 26: 646-653, 1983.
  • Barnett GO, Justice NS, et al: COSTAR - A computer-based medical information system and data language for ambulatory practices. Computer 12:9-17, 1979.
  • Horowitz GL and Bleich HL: PaperChase: A computer program to search the medical literature. N Engl J Med 305:924-930, 1981.
  • Robboy SJ, Altshuler BS, Chen HY: Retrieval in a computer-assisted pathology encoding and reporting system (CAPER). Am J Clin Pathol 75:654-661, 1981.
  • Hutchins WG: Machine Translation: Past, Present and Future. Ellis Horwood, 1986.
  • Arnold D: General view of the design methodology. In: Multilingua 5(1986)3: special issue. HL Somers (ed.) Amsterdam: Mouton de Gruyter. EUROTRA Special Issue pp. 136-139.
  • Evans DA, Hobbs JR, Miller PL, Carbonell JG: New directions for natural language processing in biomedicine. In: Proceedings of the Eleventh Annual Symposium on Computer Applications in Medical Care Nov 1-4, 1987.
  • Salton G: Dynamic Information and Library Processing. Prentice-Hall, Inc, Englewood Cliffs, NJ, 1975.
  • Canadian Government Terminology Bank TERMIUM. Booklet prepared by Terminology Directorate Translation Bureau, Dept. of the Secretary of State, Ottawa, CANADA, 1985.
  • Giere W: Foundations of clinical data automation in cooperative programs. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
  • Zhang Q: Easy entry of Chinese character set symbols. Proc 5th Ann Symp Comp Appl Med Care (Heffernan HG, ed). Washington, D.C., 1981, pp 143-149.
  • Zipf, GK: On the Economy of Words. Chapter 2 in, Human Behavior and The Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Inc., Cambridge, MA, 1949, pp 19-55.
  • Condon EU: Statistics of vocabulary. Science 67:300, 1928.
  • Chen Y-S and Leimkuhler FF: A relationship between Lotka's Law, Bradford's Law, and Zipf's Law. J Am Soc Info Sci 37:307-314, 1986.
  • Giere W: Foundations of clinical data automation in cooperative programs. 5th Ann Symp Comp Applic Med Care (Heffernan HG, ed). Washington, DC, 1981, pp. 1142-1148.
  • Hall PAV and Dowling GR: Approximate string matching. ACM Comput Surv 12:381-402, 1980.
  • Moore GW, Riede UN, Polacsek RA, Miller RE, Hutchins GM: Automated translation of German to English medical text. Am J Med 81:103-111, 1986.
  • Robboy SJ, Altshuler BS and Chen HY: Retrieval in a computer-assisted pathology encoding and reporting system (CAPER). Am J Clin Pathol 75: 654-66l, 1981.
  • Taft RL: Name Search Techniques. Bureau of Systems Development, New York State Identification and Intelligence System, Albany, New York, 1984.
  • Zipf, GK: On the Economy of Words. Chapter 2 in, Human Behavior and The Principle of Least Effort. An Introduction to Human Ecology. Addison-Wesley Press, Inc., Cambridge, MA, 1949, pp. 19-55.
  • Sun M: Japanese scientific and technological literature Information: The demand in the U.S. remains low. Science 238:1032-1033, 1987.
  • Sun M.: Translation service throws in the towel. Science 239:1482, 1988.
  • Kayser K: Logic and diagnosis. Methods Inf Med 14:76-80, 1975.
  • Weiss SM, Kulikowski CA, Amarel S, Safir A: A model-based method for computer-aided medical decision making. Artif Intel 11:145-172, 1978.
  • Weiss SM, Kulikowski CA and Safir A: Glaucoma consultation by computer. Comput Biol Med 8:25-40, 1978.
  • Pauker SG, Gorry GA, Kassirer JP, Schwartz WB: Towards the simulation of clinical cognition. Taking a present illness by computer. Am J Med 60:981-996, 1976.
  • Szolovitz P, Pauker SG: Categorical and probabilistic reasoning in medical diagnosis. Artif Intel 11:115-144, 1978.
  • Shortliffe EH, Buchanan BG: A model of inexact reasoning in medicine. Math Biosci 23:351-379, 1975.
  • Shortliffe EH, Buchanan BG and Feigenbaum EA: Knowledge engineering for medical decision making: A review of computer-based clinical decision aids. Proc IEEE 67:1207-1224, 1979.
  • Duda RO, Shortliffe EH: Expert systems research. Science 220:261-268, 1983.
  • Pauker SG and Kassirer JP: The threshold approaches to clinical decision making. N Engl J Med 302:1109-1117, 1980.
  • Schwartz WB, Wolfe HJ and Pauker SG: Pathology and probabilities. A new approach to interpreting and reporting biopsies. N Engl J Med 305:917-923, 1981.
  • Blum RL: Discovery, confirmation, and incorporation of causal relationships from a large time-oriented clinical data base: The RX project. Comput Biomed Res 15:164-187, 1982.
  • Miller RA, Pople HE Jr and Myers JD: Internist-I, an experimental computer-based diagnostic consultant for general internal medicine. N Engl J Med 307:468-476, 1982.
  • Miller RA: Internist-l/Caduceus: Problems facing expert consultant programs. Methods Inf Med 23:9-14, 1984.
  • Halpern JY, Rabin MO: A logic to reason about likelihood. IBM Research Report RJ 4313, 1984.
  • Zeman JJ: Modal Logic. The Lewis-Modal Systems, Oxford at the Clarendon Press, London, 1973.
  • Anderson JM, Johnstone HW Jr: Natural Deduction. The Logical Basis of Axiom Systems. Wadsworth Publishing Co, Inc, Belmont, CA, l962.
  • Quine WV: Theory of Deducation, Harvard Cooperative Society, Cambridge, Mass, 1948, pp 65-81.
  • Quine WV: Methods of Logic. Henry Holt & Co, New York, 1950.
  • Quine WV: The problem of simplifying truth functions. Am Math Monthly 59:521-531, 1952.
  • Quine WV: A way to simplify truth functions. Am Math Monthly 62:627-631, 1955.
  • McCluskey EJ Jr: Minimization of Boolean Functions. Bell Syst Tech J 36:1417-1444, 1956.
  • Swets JA: Information retrieval systems. Science 141:245-250, 1963.
  • Lusted LB: Introduction to Medical Decision Making. Charles C Thomas, Springfield, Ill, 1968, p 98-140.
  • Lusted LB: Decision-making studies in patient management. N Engl J Med 284:416-424, 1971.
  • Swets JA: Signal Detection in Medical Diagnosis. In, Computer Diagnosis and Diagnostic Methods (Jacquez JA, ed). Charles C Thomas, Springfield, Ill, 1972, p 8-28.
  • Meadow CT: The Analysis of Information Systems. Second Edition. Melville Publishing Co., Los Angeles, CA, 1973.
  • Date CJ: An Introduction to Database Systems. Addison-Wesley Publishing Co., Reading, Mass., 1975.
  • Greenes RA, Pappalardo AN, Marble CW, Barnett GO: Design and implementation of a clinical data management system. Comput Biomed Res 2:469-485, 1969.
  • Bowie J and Barnett GO: MUMPS - An economical and efficient time-sharing system for information management. Comput Programs Biomed 6:11-22, 1976.
  • Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In: Scherrer JR, Cote RA, Mandil SH, eds. Computerized Natural Medical Language Processing for Knowledge Representation. Amsterdam: North-Holland; 1989: 29-39.

    CONFIDENTIALITY AND PRIVACY.

  • Schneier B. Applied Cryptography. Protocols, Algorithms, and Source Code in C. New York: John Wiley & Sons, Inc. 1996.
  • Berman JJ, Moore GW. SNOMED-Encoded surgical pathology databases: a tool for epidemiologic investigation. Modern Pathology 9:944-950, 1996.
  • Berman JJ, Moore GW, Hutchin GM. Maintaining patient confidentiality in the public domain internet autopsy database. Journal of the American Medical Informatics Association (JAMIA), Symposium Supplement, pp 328-332, 1996.
  • Clayton EW, Steinberg KK, Khoury MJ, Thomson E, Andrews L, Kahn MJ, Kopelman LM, Weiss JO. Informed consent for genetic research on stored tissue samples. JAMA 1996;1786-92.
  • RETURN TO BILL MOORE'S HOME PAGE. Return to Bill Moore's Home Page
  • ADDITIONAL INFORMATION.
  • For additional information, contact Bill Moore at:
    gwmoore@erols.com

  • Last Updated: July 3, 1999.