Readme

August 5, 2003
 Yusuke Kubota, Kyoko Takano, Camille Yamada & Yuka Tateisi

Contents

  1. Introduction
    1. Outline of Fixing Procedure
  2. Segmentation of Sentences
    1. Indicators of  Fixed  Segments
    2. Periods
    3. Parentheses and Commas
    4.  Quotations and Apostrophes
    5. Hypens and Minuses
    6. CANNOT
    7. Ratio
    8. Slashes
    9. Bibliographic Information
  3. Tagging POS
    1. Symbols for Aiding Manual POS-fixing
      1. "!" assigned by the tagger
      2. Automatically inserted alternatives POSes and comments
        1. Alternative POSes (word/POS1.pos2)
        2. Automatically inserted comments (word/POS[comment])
      3. Asterisks for marking technical terms
    2. Indicatiors of Fixed Tags
    3. JJ or NN
      1. Words to be tagged as JJ
      2. Words to be tagged as JJ or NN depending on the context (expect -ing  words)
      3. Words to be tagged as NN
    4. JJ or VBN
    5. JJ, NN or VBG
      1. "Binding"
      2. -ing  words other than "binding"
    6. Symbols
      1. Symbols to be tagged as NN(S)
      2. Symbols not to be tagged as NN(S)
    7. NN or NNP
    8. Idioms
    9. Latin Words etc
    10. Other Foreign Words
    11. Suffixes
    12. Acronyms
    13. NN(P) or NN(P)S
      1. Acronyms and abbreviations of units
      2. Other cases
    14. Slashes
    15. Mistakes in the original text
    16. Misc

1 Introduction

1.1 Outline of the fixing procedure

Words should be tagged according to the Penn Treebank Manual (henceforth PTB Manual) unless stated otherwise in this document. Annotators are given files in which words are automatically tagged by a POS-tagging program. You should fix (i) segmentation (tokenization) of words and sentences and (ii) POS tags assigned to words at the same time.

2 Segmentation of sentences and words

POS-tagged files should contain one sentence per line. Words in a sentence should be separated by spaces. Special attention should be paid to periods, parentheses, commas and quotations.

Any inconsistent segmentations of a given word in the same abstract should be left as they are unless the inconsistency occurs around hyphens, parenthesis and commas. Inconsistencies found around hyphens, parenthesis and commas should be fixed according to the principles stated in sections 2.3 and 2.5.

E.g:
HpaII and Hha I

"HpaII" and "Hha I" are inconsistently segmented here (one with a space and the other without one) but should be left as they are since there are no hyphens, parenthesis and commas concerned.

2.1 Indicators of fixed segmentations

When you have fixed the segmentation of a given word or sentence, put one of the following marks to indicate the fixed place for later reference.

unchanged (with confidence)
[no mark]

unchanged (without confidence)
(@ w1 w2 w3 @)
If you are uncertain about your segmentation of a given word or phrase, enclosed it with the marks "(@ and "@)"
cut
#
including cases of cutting one sentence into two as well as one word into two
concatenated
=
The POS-tagger sometimes drops a word due to some bugs. In that case, supplement the dropped word and put the mark "*" after the (POS-tagged) word in question.
dropped word filled
*

The "cut" mark should be inserted between the separated two words. The "concatenated" mark should be inserted immediately after the concatenated word.

Marks indicating places where something has been fixed such as "#" and "=" or those indicating uncertainty on the part of the annotator such as "@@" and "?" are introduced for the purpose of facilitating error checking at the final stage of POS-fixing. So, there is really no precise guideline for where to put them and where not to put them or the order among them when more than one marks are assigned to one word. Annotators should not be troubled with these questions and should make decisions according to their own judgements.

Some examples follow:

To cut one word into two:
abcde/NN ab/NN # cde/NN
abcdefgh/NN ab/JJ # cd/JJ # ef/JJ # gh/NN
To concatenate two words into one:
ab/JJ cd/NN abcd/NN =
ab/NN cd/NN ef/NN gh/NN abcdefgh/NN =
To fill in a dropped word:
B/NN! B/Rel/NN! *
To cut one sentence into two:
... by PCR./NN We have investigated ...

... by PCR/NN # ./.! #
We have investigated ...
To concatenate two sentences into one:
... 0.23/CD vs/NNS ./.
3.7/CD +/-/CC ...

... 0.23/CD vs./CC! = 3.7/CD +/-/CC ...

2.2 Periods

Sentence-final periods should be separated from the preceding words.

E.g.:
Nakazawa/NNP met/VBD Yatabe./NNP Nakazawa/NNP met/VBD Yatabe/NNP # ./.

Other periods should be concatenated with the preceding words.

E.g.:
Dr/NNP ./. Charcot/NNP Dr./NNP = Charcot/NNP

Note that when an acronym ending in a period (e.g. L.I.S.) occurs at the sentence-final position, the last period should also be separated from the word.

E.g.:
They/PRP are/VBP members/NNS of/I N L.I.S./NNP ... L.I.S/NN P # ./.

Segmentations of initials, even if they are inconsistent, should be left as they are. (E.g., "J.M . Folon" should not be fixed as "J.M.Folon"or"J . M . Folon". Likewise, "J . M . Folon" should not be fixed as "J.M . Folon" or whatever.) However, in cases like "R.and" where it is clear that the initial is mistakenly concatenated with the next word, the two words should be separated as in "R . and".

Note that periods are used for various purposes other than marking sentence boundaries, such as decimal points, part of EC numbers (e.g. EC.1.2.3.4) and part of substance names (e.g. PU.1). If you are uncertain whether or not to cut a sentence at a given period, refer to the original .txt file and keep the segmentation consistent with it.

When a sentence is incorrectly separated into two in the original .tx t file, the two sentences should be concatenated into one but the word with a period (e.g. E . in the following example) should not be concatenated with the next word in the corrected POS-tagged file.

E.g.:
E/FW ! ./. histolytica/FW !

E./FW ! # = histolytica/FW ! (not E.histolytica/FW)

2.3 Parentheses and commas

Parentheses and commas appearing inside substance names should be concatenated with preceding and following words.

E.g.:

Parentheses and commas appearing inside numerical expressions or formulas should also be concatenated with preceding and following words.

E.g.:

Other parentheses and commas should be separated from preceding and following words. Note in particular that parentheses enclosing abbreviations of substance names should be separated from preceding and following words.

E.g.:
Interleukin/NN (/( IL/NN )/) -6/CD

Tokens like "10(-4)/bp" should be regarded as a single expression designating a unit and hence should be segmented as a single word and tagged as NN. ("10(-4)" means 104)

2.4 Quotations and apostrophes

The rules in this section, except the first one, are tacitly observed in the Penn Treebank corpus, although they are not explicitly stipulated in the PTB manual.

2.5 Hyphens and minuses

Hyphens used as dashes should be separated from preceding and following words.

E.g.:
This-also profound-blah blah blah This # - # also profound # - # blah blah blah .

Words separated at hyphens inside them (e.g. "Hughling-Jackson") should be concatenated unless the fragment with the hyphen constitutes a conjunct (e.g. "up-"in"up- and downregulation"). Some examples of conjunctions follow:

Note that there are words ending with a minus. Examples of words ending with a minus are:

"-" used in expressions indication ranges or locations in genes or used as parts of substance names should all be concatenated with preceding and following words.

Expressions of the form "Substance Name (Abbreviation)-Number" should be segmented and tagged as "Substance Name/NN (/( Abbreviation/NN )/)# -Number/CD".

E.g.:
interleukin (IL)- 6 interleukin/NN (/ ( IL/NN )/) # -6/CD
Similar example:
Nuclear Factor (NF)-kapp a B Nuclear/JJ Factor/NN (/ ( NF/NN )/) # -kappa/NN B/NN

When "-" follows ")" in expressions of the form "Substance1 (XXX)-and Substance2 (YYY) -XXXed", (where "XXX" and "YYY" supply some information related to Substance1 and Substance2, respectively) the hyphen between ")" and "and" should be separated from preceding and following words and should be tagged as ":".

E.g.:
lipopolysaccharide/NN (/( LPS/NN )/) -/: and/CC N-formyl-methionyl-leucyl-phenylalanine/NN (/( FMLP/NN )/) -stimulated/JJ

A space appearing inside a part of a substance name enclosed by parentheses should be converted to an under bar ("_"). For example, "1-(5-Isoquinoliny l sulfonyl)-2-methylpiperazine" should be converted to "1-(5-Isoquinolinyl_sulfonyl)-2-methylpiperazine". Spaces outside parentheses should be left as they are unless they are immediately preceded or followed by a hyphen.

2.6 CANNOT

"cannot" should be split into "can" and "not" in accordance with the PTB Manual and should be tagged as "can/MD not/RB".

2.7 Ratios

"Substance Name:Substance Name" should be segmented as a single word and tagged as NN.

E.g.:
colombus/NN :/: milk/NN =/JJ 1:3/CD colombus:milk/NN = =/JJ 1:3/CD
"Number:Number" should be segmented as a single word and tagged as CD.
E.g.:
1/CD :/ : 3/CD 1:3/CD =

2.8 Slashes

Parentheses and slashes in expressions of the form "Substance1 (XXX)/Substance2 (YYY)"(where "XXX" and "YYY" supply some information related to Substance1 and Substance2, respectively) should all be separated from preceding and following words. In this case, "/" should be tagged as ":".

E.g.:
cAMP response element binding protein ( CREB ) //: activating transcription factor- 1 ( ATF- 1 )

2.9 Bibliographic information

Parentheses and semicolons appearing as part of bibliographic information that are concatenated with preceding and following words in the original .txt file should be separated from preceding and following words.

E.g.:
Immunity 1999 Mar;10(3):39 9 ... Mar/NNP ;/: 10/CD (/ ( 3/CD )/) :/: 399/CD

However, expressions indicating ranges of pages should be treated as a single unit and should be tagged as CD (e.g. "399-420/CD"). This is in accordance with the rule that ranges should be segmented as a single word (section 2.5). Pointers to references like "[1]" and "(1)" appearing at the end of a sentence should be segmented and tagged like "[/ ( 1/C D ]/)".

3 Tagging POS

Technical terms, Latin words and idiomatic expressions that frequently appear are automatically assigned correct tags according to the Term List. Hence, if you are uncertain whether to fix the tags of such words, you should rather leave them as they are. When you are uncertain about the tags of ordinary words, try to determine them by the following procedure:

  1. Follow the guidelines in this document. If it doesn't help,
  2. follow the guidelines in the PTB Manual. If it doesn't help,
  3. consult the Longman Dictionary of Contemporary English (LDOCE), 3rd edition. If it doesn't help,
  4. consult Kenkyusha's English-Japanese Dictionary for the General Reader, 2nd edition. If you are still uncertain,
  5. refer to previous corpus. Bear in mind that you can refer to previous corpus only for the purpose of confirming some rule or agreement that you have become unsure of. If you want to refer to previous corpus for other purposes, consult other annotators and persons concerned beforehand. If you are still uncertain,
  6. send an email to other annotators and persons concerned. The WSJ Corpus might be consulted at this stage if considered appropriate.
  7. Discuss with other annotators and persons concerned to establish an agreement.

3.1 Symbols for aiding manual POS-fixing

Various symbols are inserted to the POS-tagged file for the purpose of aiding the manual POS- fixing procedure. The symbols can be classified into two broad categories: (i) those marking words whose POS tags don't have to be checked and (ii) those marking words whose tags should be determined by specific principles.

This section describes in detail what these symbols mean and what they do not mean. (Note that correct POS-fixing is impossible when you are unsure how to read such symbols. Since some of the symbols are inevitably quite confusing and misleading, even an experienced annotator should pay close attention not to misunderstand the meanings of these symbols.)

3.1.1 "!" assigned by the tagger

POS tags already followed by "!" (e.g. "NF-kappaB/NN!", "interleukin-1/NN!") are assigned by the POS tagger according to the Term List. Hence, you don't usually have to fix such tags. However, if you find any apparent errors with these tags, correct them and put the mark "!!" (if you are confident about your correction) or "!?" (if you are not confident). If the error in question is a simple careless mistake that can be easily fixed, report it immediately by email. Also, when you find words that should be registered in the PREPOS list but is not yet registered, report it by email.  Special attention should be paid to the following cases:

  1. past participles tagged uniquely as VBN by the POS tagger1
    A past participle tagged uniquely as VBN by the tagger should nevertheless be tagged as JJ when it appears between nominal expressions, and should be tagged as VBD when it is used as a past main verb.
    E.g.:
    We/PRP investigated/VBD how/WRB these/DT stimuli/NNS affect/VBP mitogen/NN activated/JJ!! protein/NN (/( MAP/NN )/) kinases/NNS ./.
    Thus/RB ,/, we/PRP demonstrated/VBD that/IN IL-5/NN activated/VBD!! the/DT Jak/NN 2-STAT/NN 1/CD signaling/NN pathway/NN in/IN eosinophils/NNS ./.
  2. the POS tags of `s'
    `s' is tagged as NNS by the POS tagger (since it should be so tagged in vast majority of cases). However, it should be tagged as VBZ when it is used as a third person singular present suffix, and as NN when it abbreviates for "second" (unit).

3.1.2 Automatically inserted alternatives

POSes and comments Letters followed by a tilde `~' or a comment [...], inserted immediately after the POS tag by the script tilde.pl, calls attention to correct POS tags. Comments and signs inserted by tilde.p l are all removed by the script rmtilde.p l after the manual POS-tagging process is completed.

3.1.2.1 Alternative POSes (word/POS1.pos2)

Some words should be assigned different POS tags in accordance with the contexts in which they appear. In such cases, a candidate alternative POS pos2 is sometimes inserted in the lower case preceded by a tilde after the original POS POS1. In other words, word/POS1.pos2 means that although word was originally tagged as POS1 by the tagger, it might as well be pos2. The correct POS tag should be determined by the annotator according to the context.

E.g.:
upstream/RB~jj This means that the POS of `upstream', which was originally tagged as RB, might be JJ.
3.1.2.2 Automatically inserted comments (word/POS[comment])

When a comment [comment] is inserted immediately after the POS tag, fix the POS according to it.

3.1.3 Asterisks for marking technical terms

Some words are surrounded by concatenations of two asterisks (e.g. **TNFRI**/NN). As for such words, you don't have to care about whether the POS is correct or not.

Asterisks are attached to words which do not appear in ordinary dictionaries. As a result, many of them are technical terms. However, asterisks are eliminated from words which are composed of combinations of more than one elements of 1-3 (e.g. E1A, PCOS, LTalpha), because they might be NNS. Instead, the comment [NNS?] is attached to such words.2

  1. a capital letter
  2. an English equivalent of a Greek alphabet
  3. a numeral
Asterisks will not be attached to simple English equivalents of Greek alphabets such as `alpha', either.

3.2 Indicators of fixed tags

Each POS tag should be followed by one of the following marks:

unchanged 
(with confidence) [no mark]
unchanged (without confidence) @
fixed (with confidence)  !
fixed (without confidence) ?
pending
^

Note that

3.3 JJ or NN

3.3.1 Words to be tagged as JJ

3.3.2 Words to be tagged as JJ or NN depending on the context (except -ing words)

3.3.3 Words to be tagged as NN

3.4 JJ or VBN

Apply the following rules with priority given to higher items over lower ones.3
  1. When a given past participle modifies a nominal expression and is preceded by a nominal expression modifying it or is hyphenated, it should be tagged as JJ. (From PTB Manual)
    E.g.:
    • 10/CD genetic/JJ probe-based/JJ tests/NNS
    • a/DT government-approved/JJ safety/NN seat/NN
    • airline/NN leveraged/JJ buy-outs/NNS
    • Thomson/NN consolidated/JJ earnings/NNS
  2. When a given past participle is a complement of the be-verb or modifies a preceding nominal expression and is followed by a by-phrase, it should be tagged as VBN. (From PTB Manual)
  3. If the past participle is followed by "state", "form", "pattern" or "fashion", it should be tagged as JJ.
  4. Expressions of the form "nonXXX", "unXXX" or "inXXX" with the meaning "opposite of XXX" or "not XXX" should be tagged as JJ.
  5. Expressions of the form "preXXX" (in which the prefix "pre" carries the meaning "before") should be tagged as JJ.
  6. Refer to the PREPOS list.
  7. Refer to the ED list that is appended to this document. The POS of a word that doesn't appear in the ED list should be determined by the following criteria. When the POS is determined, the word will be added to the ED list if it is considered to be appropriate. (As usual, among the following criteria, priority should be given to higher items over lower ones.)
    1. Words that should be assigned different tags for different meanings:
      • disturbed, depressed: when used to refer to a certain human mental condition, should be tagged as JJ; otherwise, should be tagged as VBN.
      • cripped, inflamed: when used to refer to a condition of (the body of) a living thing, should be tagged as JJ; otherwise, should be tagged as VBN.
    2. -ed words whose base forms are unambiguously intransitive should be tagged as JJ.
    3. -ed words that has no corresponding base verb form (e.g. threaded) should be tagged as JJ.
    4. If the past participle is an expression that refers to a position or place (e.g. located, positioned), it should be tagged as JJ.
    5. If the past participle refers to a result of a chemical reaction (e.g. phosphorylated, ionized; activated, inhibited), it should be tagged as VBN.
    6. -ed words with adjective entries in LDOCE should be tagged as JJ as long as the meanings are not significantly different from the dictionary definitions.
    7. If the past participle refers to an event it should be tagged as VBN; if it refers to a state, it should be tagged as JJ. (From PTB Manual)
    8. -ed words with prefixes should generally be tagged in the same way as those without prefixes. (E.g.: coactivated and colocated should be tagged in the same way as activated and located, respectively; the former should generally be tagged as VBN and the latter should in general be tagged as JJ. Note, however, that there are some exceptions to this generalization, e.g., related/J J vs. correlated/VBN.)
  8. If "XXXed NN" can be paraphrased as "NN that is (are) XXXed", "XXXed" should be tagged as VBN. If not, it should be tagged as JJ.
  9. 9. If you feel uncertain or if you want to refer to the WSJ Corpus, report the case by email. Decisions will be made by discussion among annotators and persons concerned.

3.5 JJ, NN or VBG

3.5.1 "Binding"

The POS of "binding" should be determined by the following procedure. (Priority should be given to higher rules over lower ones.) If you are uncertain about your decision, put the mark "^" immediately after the POS tag as in "NN^". (If you cannot determine the POS of a given instance of "binding" uniquely, you should specify the (logically and contextually) possible alternatives by the vertical bar notation as in "NN|VBG" and put "^" after the POS tag. If you cannot even specify tags like "NN|VBG", just put "^" on the word. If you can determine the tag uniquely with confidence, "^" is not necessary.)

  1. Follow the ING list and the PREPOS list.
  2. When "binding" is hyphenated, apply the guidelines about hyphenated words in this document.
    E.g.:
    To gain a better understanding of the mechanism of binding to the human mineralocorticoid receptor (hMR), we developed a new monoclonal antibody (mAb) raised against the hormone-binding/JJ domain (HBD).
  3. When "binding" is in the head-of-subject position without any preceding determiner, possessive pronoun ("its"), adjective or adverb, it should be tagged as NN.
    E.g.:
    this region is also involved in NFAT binding to activated calcineurin, since binding/NN is inhibited by an NFAT peptide spanning the calcineurin docking site on NFAT.
  4. When "binding" has a dependent determiner or adjective as a preceding word, it should be tagged as NN.
    E.g.:
    In gel retardation analysis, FK506 had little effect on the binding/NN to the AP-1 site of PMA/ionomycin-induced nuclear factors, which were recognized with anti-JunD or c-Fos antibody.
  5. When "binding" is immediately followed by an of -phrase, it should be tagged as NN.
    E.g.:
    When we analyzed the nature of STAT proteins capable of binding to IL-2Ralpha, pim-1, and IRF-1 GAS elements after cytokine stimulation, we observed binding/NN of STAT1, STAT3, and STAT4, but not STAT5 to all of these elements.
  6. When "binding" functions as an object of an immediately preceding preposition, it should be tagged as NNVBG. However, if "binding" appears in the expression "capable of binding", it should be tagged as VBG.
    E.g.:
    We propose that HIV-1 tat may inhibit MIP-1 alpha expression by inducing MNP-1 expression in T-cells, probably by either competing with MNP-2 for binding/NN|VBG^ to the MIP-1 alpha promoter or by sequestering it into inactive forms.
  7. When "binding" is immediately preceded by an NN(S),

3.5.2 -ing words other than "binding"

Apply the following rules with priority given to higher items over lower ones.
  1. When a given present participle modifies a nominal expression and is hyphenated, it should be tagged as JJ. (From PTB Manual)4 Note that when the present participle in question is used in apposition to the following noun as in "DNA-binding/NN activity"5, it should not be regarded as a prenominal modifier, and hence, this rule should not be applied. 
  2. When "NN XXing" is a prenominal modifier or behaves as a nominal expression, "XXing" should be tagged as NN. (From PTB Manual)
  3. Refer to the PREPOS list.
  4. In some cases, the POS is determined by the following noun. . -ing/NN {activity, affinity, capacity, characteristics, effect, event, property, function, mechanism}
    When the -ing word is in apposition to the following noun, it should be tagged as NN. -ing words are often used appositively when followed by one of the nouns listed above. However, we have found one instance of "-ing event" in which the -ing word is not in apposition to the following noun (i.e. event). Hence, be careful to consider the context when applying this rule. . -ing/JJ {form, pattern, fashion, state} (the same as "-ed/JJ form", etc.)
  5. Refer to the ING list that is attached to this document.
  6. -ing words with adjective entries in LDOCE should be tagged as JJ as long as the meanings are not significantly different from the dictionary definitions.
  7. Expressions of the form "preXXXing" in the prenominal position should be tagged as JJ. It is not likely to appear as a complement of the be-verb.
  8. If "NN1 XXing NN2" can be paraphrased as "NN1 that XXs NN2", "XXXing" should be tagged as VBG. (From PTB Manual)
  9. If "XXing NN" can be paraphrased as "NN that XX(es) (something)", "XXXing" should be tagged as VBG. (From PTB Manual)
  10. If you encounter words that you have not seen before and feel uncertain about its POS, report the case by email. Decisions will be made by discussion among annotators and persons concerned.

3.6 Symbols

SYM will not be used. Symbols should be assigned ordinary tags according to their grammatical functions. (In most cases, it should be tagged as NN or NNS.)

3.6.1 Symbols to be tagged as NN(S)

Units composed of alphabets (e.g. nM, microM). The grammatical number should agree with the number of the numerator if the whole unit is in the form of a fraction. . English equivalences of Greek letters (e.g. alpha, beta).

3.6.2 Symbols not to be tagged as NN(S)

3.7 NN or NNP

Nominal expressions should be tagged as NN unless they are clearly proper names. More specifically, all nominal expressions should be tagged as NN except those listed below: Note that names of diseases and the like that are taken from names of persons (e.g. "Down" in "Down syndrome") should be tagged as NN. "Person Name-Person Name" should be treated in the same way as NN-NN (e.g. "Epstein-Barr/JJ virus"). "Southern", "Northern", "Western" and "Northwestern" in the position of X in "X blotting" (experiment names) should be tagged as NN. It should be noted that words that begin with capital letters are not necessarily proper names (e.g. Copyright/NN 1997/CD Academic/NNP Press/NNP ./.).

3.8 Idioms

If you find a new idiomatic expression, report it by email. The tag will be determined according to the WSJ Corpus.

3.9 Latin words, etc.

The tags of Latin words are also determined for each item. In principle, the tag of a given Latin word is determined by the following criteria:

3.10 Other foreign words

Tags of other foreign words are basically determined for each specific word.

3.11 Suffixes

Hyphens should in principle be concatenated with preceding and following words but should not be concatenated with conjunctions and parentheses enclosing abbreviations.
E.g.:
In this case, the word beginning or ending with the hyphen (i.e. "XXX-" or "-YYY" in the above examples) should be tagged as if it were not prefixed or suffixed with the hyphen.
E.g.:
However, "-XXXed" in expressions like: "Noun1/NN (/( Abbreviation of Noun1/NN)/)-XXXed/?? Noun2/NN" should be tagged as JJ even if "-XXXed" is a word to be tagged as VBN by default. (This is to keep consistent with cases where the abbreviation enclosed by the parentheses does not appear. In such cases, the segmentation and tags goes like "Noun1-XXXed/JJ Noun2/NN".)
E.g.:
lipopolysaccharide/NN (/( LPS/NN )/) -induced/JJ TF/NN expression/NN
Also, plural suffixes that stand alone as single words in expressions like "factor/NN (/( s/NNS )/)" ("factor(s)" in the original text) should be tagged as NNS. Third person singular present suffixes should be tagged as VBZ as in "play/VBP (/( s/VBZ )/)".

3.12 Acronyms

The POS of an acronym should conform to the POS of the original form spelt out in the same abstract if there is any.
E.g.:
TI - Regulation of fas-ligand expression
"fas-ligand" is a hyphenated prenominal word and is tagged as JJ. Thus, "FasL", the abbreviation thereof, should also be tagged as JJ. Acronyms whose original forms cannot be found in the same abstract should by default be tagged as NN.
For the distinction between NN or NNS for the POS of an abbreviation of a nominal expression, refer to section 3.13.

3.13 NN(P) or NN(P)S

3.13.1 Acronyms and abbreviations of units

Both acronyms and abbreviations of units which retain their original plural suffixes should be tagged as NNS (e.g. EBVs/NNS, hrs/NNS).

Acronyms and abbreviations of units which do not have plural suffixes should be tagged according to different criteria described below.

Acronyms
It is difficult to determine the grammatical number of acronyms of substance names and the like. But they generally fall under one of the categories listed below. Be careful to judge which of the following rules can be applied to specific cases you are dealing with. If none of the following rules are applicable to the case you are dealing with, report it by email.
Abbreviations of units
In contrast to acronyms, abbreviations of units which have lost their original plural suffixes should simply be tagged as NN even if their original forms can be clearly detected to be plural.
E.g.:
base pairs ( bp/NN )
Even when preceded by numbers larger than 1, abbreviations of units that do not have plural suffixes should be tagged as NN (e.g. 2/CD hr/NN).

3.13.2 Other cases

3.14 Slashes

Tentatively, slashes should not be separated from preceding and following words except cases in which it is explicitly stated that they should stand alone as independent words. Words concatenated by slashes should be tagged according to the following guidelines: The treatment of slashes will be examined later and the segmentation and tags of words containing slashes will be fixed according to the principles that will be established then.

3.15 Mistakes in the original text

Sentences with mistakes that are already found in the original .txt file should be prefixed with "@@". Some common errors are listed below:

3.16 Misc


Notes

  1. This is done so for -ed words whose POS tags are almost always VBN in order to reduce the amount of manual POS-fixing.
  2. However, for a word that can never be NNS, asterinsks should be left as they are even if the word in question matches the pattern described here (e.g. **NF-kappaB**/NN). If you find asterisks missing from such words, send an email to other annotators and persons concerned.
  3. Strictly speaking, not all of the following rules conflict with one another. So, it is not necessary to specify the order among them as rigorously as is done below. For example, the rules 1, 2 and 3 cover distinct cases, which means that the ordering among them is completely irrelevant. In general, the order among them should be something like "[1,2,3] > [4,5] > 6 > 7 > 8". In what follows, however, all the rules are ordered just to maintain the format consistent.
  4. This rule overrides the next one. Hence, when "NN XX-YYing" in "NN XX-YYing NN" is a prenominal modifier, "XX-YYing" should be tagged as JJ.
  5. Incidentally, "DNA-binding" should be tagged as JJ in vast majority of other cases. E.g.: DNA-binding/JJ {region, sites, protein, complex}
  6. Note that not all instances of "I" are roman numerals.
  7. When followed by a closing parenthesis like "1)", they should be segmented and tagged like "1/LS )/)".
  8. Incidentally, note that although the number of the denominator is plural in this example, which is considered to be a mistake on the part of the author, this is entirely irrelevant to the number of the whole expression.

-ed

a-e f-j k-o p-t u-z

Prenominal After "be" without "by-"
(co)cultured VBN VBN
(co)localized JJ JJ
abolished VBN VBN
accepted VBN VBN
accumulated VBN VBN
acetylated VBN VBN
achieved VBN VBN
acquired VBN VBN
activated VBN VBN
added VBN VBN
administered VBN VBN
admitted VBN VBN
advanced JJ ??
affected VBN VBN
aged JJ JJ
altered JJ? VBN?
amplified VBN VBN
analysed VBN VBN
analyzed VBN VBN
anchored VBN VBN
anergized VBN VBN
anucleated JJ JJ
applied VBN VBN
arranged VBN VBN
arrested VBN VBN
ascribed VBN VBN
assessed VBN VBN
assigned VBN VBN
associated VBN VBN
assumed VBN VBN
attenuated VBN VBN
augmented JJ JJ
balanced JJ JJ
barbed JJ JJ
believed VBN VBN
biotinylated VBN VBN
blocked VBN VBN
blunted VBN VBN
born JJ JJ
bound VBN VBN
butylated VBN VBN
called VBN VBN
capped JJ ??
changed VBN VBN
characterized VBN VBN
charged JJ JJ
classified VBN VBN
cloned VBN VBN
co-transfected VBN VBN
coexpressed VBN VBN
coimmunoadsorbed VBN VBN
coinfected VBN VBN
combined JJ VBN
commited VBN VBN
compared VBN VBN
compensated VBN VBN
composed VBN VBN
comprised VBN VBN
computerized VBN VBN
concluded VBN VBN
conditioned JJ JJ
conferred VBN VBN
confined VBN VBN
confirmed VBN VBN
conserved VBN VBN
considered VBN VBN
consisted VBN VBN
constructed VBN VBN
contained VBN VBN
continued JJ VBN
controlled JJ VBN
converted VBN VBN
coordinated VBN VBN
correlated VBN VBN
coselected VBN VBN
cotransfected VBN VBN
coupled JJ VBN
created VBN VBN
crosslinked VBN VBN
cultured VBN VBN
debulked VBN VBN
decreased VBN VBN
defined VBN VBN
degraded VBN VBN
delayed VBN VBN
deleted JJ VBN
delimited VBN VBN
demonstrated VBN VBN
dephosphorylated VBN VBN
derived VBN VBN
described VBN VBN
deseased JJ JJ
designated VBN VBN
designed VBN VBN
detailed JJ VBN
detected VBN VBN
determined VBN VBN
developed VBN VBN
differentiated VBN VBN
diminished VBN VBN
directed VBN VBN
discussed VBN VBN
disordered VBN VBN
displayed VBN VBN
disrupted VBN VBN
dissociated VBN VBN
diverged JJ JJ
documented VBN VBN
downregulated ?? VBN
duplicated JJ JJ
effected VBN VBN
elaborated VBN VBN
elevated JJ JJ
eliminated VBN VBN
elucidated VBN VBN
elutriated VBN VBN
embedded JJ JJ
employed VBN VBN
encoded VBN VBN
endocytosed VBN VBN chemical reaction
endowed JJ JJ
engineered VBN VBN
enhanced VBN VBN
enhanced VBN VBN
enriched JJ JJ
errected JJ H
established JJ VBN
esterified VBN VBN
evaluated VBN VBN
examined VBN VBN
excised VBN VBN
excluded VBN VBN
exerted VBN VBN
expanded VBN VBN
exposed VBN VBN
expressed VBN VBN
extended VBN VBN
extracted VBN VBN
grown JJ VBN
localized JJ JJ

Prenominal After "be" without "by-"
fixed JJ ?
footprinted VBN VBN
forced VBN VBN
formed VBN VBN
formylated VBN VBN
formylated VBN VBN
found VBN VBN
fragmented JJ JJ
generalized VBN VBN
generated VBN VBN
given JJ VBN Given/IN XXX (conditional)
grown JJ VBN
hypopigmented JJ JJ
hypothesized VBN VBN
identified VBN VBN
immobilized VBN VBN
immortalized VBN VBN
immunostained VBN VBN
impaired JJ JJ
implicated VBN VBN
increased VBN VBN
incubated VBN VBN
induced VBN VBN
infected JJ VBN
infected JJ VBN
inflamed JJ JJ when it modifies tissues or organs
inhaled VBN JJ
inherited VBN VBN
inhibited VBN VBN
initiated VBN VBN
injected VBN VBN
inoculated VBN VBN
integrated VBN VBN
interconnected JJ JJ
inverted JJ|VBN? VBN
investigated VBN VBN
involved VBN VBN
isolated VBN VBN
juxtaposed VBN VBN

Prenominal After "be" without "by-"
known JJ VBN
labelled VBN VBN
limited JJ JJ
linked VBN VBN
loaded VBN VBN
localized JJ JJ
located JJ JJ
lowered JJ JJ
maintained JJ JJ
mapped VBN VBN
marked JJ JJ
measured VBN VBN
mediated VBN VBN
microdissected VBN VBN
mixed JJ VBN
modified VBN VBN
modulated VBN VBN
mutagenized VBN VBN
mutated VBN VBN
myristoylated VBN VBN
named VBN VBN
needed VBN VBN
neutralized VBN VBN
noted VBN VBN
noticed VBN VBN
nucleated JJ JJ
observed VBN VBN
obtained VBN VBN
opsonized VBN VBN
optimized VBN VBN
overexpressed VBN VBN
oxidized VBN VBN

Prenominal After "be" without "by-"
paired JJ JJ
paralleled JJ JJ
parasitized VBN VBN
performed VBN VBN
perturbed VBN VBN
phosphorylated VBN VBN
plated VBN VBN
poised VBN VBN
polarized VBN VBN
polyadenylated VBN VBN
polyubiqitinated VBN VBN
potentiated JJ JJ
predicted VBN VBN
prepared VBN VBN
presented VBN VBN
preserved JJ JJ
presumed VBN VBN
primed VBN VBN
processed VBN VBN
produced VBN VBN
programmed VBN VBN
prolonged JJ JJ
pronounced JJ JJ
proposed JJ VBN?
protected JJ VBN
published VBN VBN
pulsed JJ JJ?
purified VBN VBN
radiolabeled VBN VBN
reactivated VBN VBN
reared VBN VBN
rearranged VBN VBN
reasoned VBN VBN
recognized VBN VBN
reconstituted VBN VBN
reconstructed VBN VBN
recovered VBN VBN
recruited VBN VBN
reduced VBN VBN
referred VBN VBN
regulated VBN VBN
related JJ JJ
released VBN VBN
rendered VBN VBN
repeated JJ VBN
replaced VBN VBN
replicated VBN VBN
reported VBN VBN
required JJ VBN
rescued VBN VBN
resected JJ VBN
restricted JJ JJ
resulted VBN VBN
retained VBN VBN
retarded VBN VBN
retrodiffentiated VBN VBN
returned VBN VBN
revealed VBN VBN
reversed VBN VBN
revisited VBN VBN
ruptured VBN VBN
saturated JJ JJ
secreted VBN VBN
seeded VBN VBN
selected VBN VBN
sensitized VBN VBN
sequestered VBN VBN
seroconverted VBN VBN
shared JJ VBN
skewed JJ VBN
slipped JJ JJ
sorted VBN VBN
speckled JJ JJ
spliced VBN VBN
stimulated VBN VBN
stranded JJ JJ
stratified ɂ
stressed JJ JJ
studied VBN VBN
submitted VBN VBN
substituted VBN VBN
subtracted VBN VBN
suggested VBN VBN
supercoiled JJ JJ
suppressed VBN VBN
surveyed VBN VBN
sustained JJ JJ
synchronized VBN VBN
synthesized VBN VBN
targeted VBN VBN
targetted VBN VBN
termed VBN VBN
tested VBN VBN
transactivated VBN VBN
transcribed VBN VBN
transduced VBN VBN
transfected VBN VBN
transferred VBN VBN
transformed VBN VBN
translated VBN VBN
translocated VBN VBN
transplanted VBN VBN
treated VBN VBN
truncated VBN VBN

Prenominal After "be" without "by-"
ubiquitinated VBN VBN
ubiquitinylated VBN VBN
underlined VBN VBN
underphosphorylated VBN VBN
undertaken VBN VBN
used VBN VBN
utilized VBN VBN
washed VBN VBN
XXXmerized JJ JJ

-ing

JJ (those marked with * has an adjective entry/entries in LDOCE)

NN

VBG


PREPOS data

The following expressions are technical terms and other expressions that are assigned pre-fixed POSs independent of the context they appear by "prepos.pl" before mechanical POS-tagging..

Foreign Terms

Sicentific Names of Organisms(treated as Latin)

B. burgdorferi => B./FW burgdorferi/FW
Borrelia burgdorferi => Borrelia/FW burgdorferi/FW
C. Sub. => C./JJ Sub./NN
C. albicans => C./FW albicans/FW
C. neoformans => C./FW neoformans/FW
Candida albicans => Candida/FW albicans/FW
Catalytic Subunit => Catalytic/JJ Subunit/NN
Cryptococcus neoformans => Cryptococcus/FW neoformans/FW
D. melanogaster => D./FW melanogaster/FW
Drosophila melanogaster => Drosophila/FW melanogaster/FW
E. coli => E./FW coli/FW
Escherichia coli => Escherichia/FW coli/FW
L. donovani => L./FW donovani/FW
L. monocytogenes => L./FW monocytogenes/FW
L. pneumophila => L./FW pneumophila/FW
Legionella pneumophila => Legionella/FW pneumophila/FW
Leishmania donovani => Leishmania/FW donovani/FW
Listeria monocytogenes => Listeria/FW monocytogenes/FW
P. aeruginosa => P./FW aeruginosa/FW
P. falciparum => P./FW falciparum/FW
Pseudomonas aeruginosa => Pseudomonas/FW aeruginosa/FW
Plasmodium falciparum => Plasmodium/FW falciparum/FW
S. aureus => S./FW aureus/FW
S. cerevisiae => S./FW cerevisiae/FW
S. oedipus => S./FW oedipus/FW
S. typhimurium => S./FW typhimurium/FW
Saccharomyces cerevisiae => Saccharomyces/FW cerevisiae/FW
Saguinus oedipus => Saguinus/FW oedipus/FW
Salmonella typhimurium => Salmonella/FW typhimurium/FW
Staphylococcus aureus => Staphylococcus/FW aureus/FW
T. annulata => T./FW annulata/FW
T. pallidum => T./FW pallidum/FW
Theileria annulata => Theileria/FW annulata/FW
Treponema pallidum => Treponema/FW pallidum/FW

Other Terms of Latin Origin

bona fide => bona/FW fide/FW
in vitro => in/FW vitro/FW
In vitro => In/FW vitro/FW
in vivo => in/FW vivo/FW
In vivo => In/FW vivo/FW
ex vivo => ex/FW vivo/FW
Ex vivo => Ex/FW vivo/FW
de novo => de/FW novo/FW
De novo => De/FW novo/FW
in situ => in/FW situ/FW
In situ => In/FW situ/FW
in-situ => in-situ/FW
In-situ => In-situ/FW
per se => per/FW se/FW
et al => et/FW al/FW
et al. => et/FW al./FW
etc => etc/FW
etc. => etc./FW
ie. => i.e./FW
i.e. => i.e./FW
i. e. => i./FW e./FW
e.g. => e.g./FW
e. g. => e./FW g./FW
ca. => ca./FW
versus. => vs./CC
via => via/IN
vs. => vs./CC
per => per/IN

Greek Letters

alpha => alpha/NN
beta => beta/NN
gamma => gamma/NN
kappa => kappa/NN
Alpha => Alpha/NN
Beta => Beta/NN
Gamma => Gamma/NN
Kappa => Kappa/NN

Other terms of foreign origin

Liuwei Dihuang Pills => Liuwei/FW Dihuang/FW Pills/NNS
bacille Calmette-Guerin vaccine => bacille/FW Calmette-Guerin/FW vaccine/NN

Confusing Singular/Plural Forms

anlage => anlage/NN
foci => foci/NNS
focus => focus/NN
mice => mice/NNS
nuclei => nuclei/NNS
nucleus => nucleus/NN
pancreas => pancreas/NN
pancreata => pancreata/NNS
sera => sera/NNS
serum => serum/NN
women => women/NNS

Multi-word prepositions and other fixed expressions

According to => According/VBG to/IN
As well as => As/RB well/RB as/IN
Due to => Due/JJ to/TO
In general => In/IN general/JJ
In particular => In/IN particular/JJ
Instead of => Instead/RB of/IN
That is, => That/DT is/VBZ ,/,
Whereas => Whereas/IN
a means => a/DT means/NN
according to => according/VBG to/IN
as well => as/RB well/RB
as well as => as/RB well/RB as/IN
by means of => by/IN means/NN of/IN
due to => due/JJ to/TO
each other => each/DT other/JJ
in general => in/IN general/JJ
in particular => in/IN particular/JJ
instead of => instead/RB of/IN
so called => so/RB called/VBN
such as => such/JJ as/IN
that is, => that/DT is/VBZ ,/,
whereas => whereas/IN
at least => at/IN least/JJS

Numeric Expressions

Units: treat as nouns

Tag prenominal "number-unit" (number nyphnated with unit) expressons as adjectives.
microM => microM/NN
fmol/10(7) => fmol/10(7)/NN
bp => bp/NN

Symbols

[ => [/(
] => ]/)
+/- => +/-/CC
+ => +/CC
- => -/CC
> => >/JJR (May be RBR, depending on context)
< => </JJR (May be RBR, depending on context)
= => =/JJ (May be RB, depending on context)

Phreses Treated as Adjectives

Note:

Abbreviations

CD1+RelB+ => CD1+RelB+/JJ
CD14(+) => CD14(+)/JJ
CD14+ => CD14+/JJ
CD34(+) => CD34(+)/JJ
CD4(+) => CD4(+)/JJ
CD4(-) => CD4(-)/JJ
CD4+ => CD4+/JJ
CD4- => CD4-/JJ
CD68(+) => CD68(+)/JJ
CD68(-) => CD68(-)/JJ
CD68+ => CD68+/JJ
CD68- => CD68-/JJ
CD8(+) => CD8(+)/JJ
CD8(-) => CD8(-)/JJ
CD8+ => CD8+/JJ
CD8- => CD8-/JJ
FCS+ => FCS+/JJ
FCS- => FCS-/JJ
GD3+ => GD3+/JJ
GD3- => GD3-/JJ
HTLV-I(+) => HTLV-I(+)/JJ
p53-/- => p53-/-/JJ

Other Adjectives

(AOP)-RANTES-triggered => (AOP)-RANTES-triggered/JJ
3' => 3'/JJ
3-hydroxy-3-methylglutaryl => 3-hydroxy-3-methylglutaryl/JJ
5' => 5'/JJ
5-Isoquinolinyl => 5-Isoquinolinyl/JJ
A-independent => A-independent/JJ
B-lymphoblastoid => B-lymphoblastoid/JJ
Ca(2+)-dependent => Ca(2+)-dependent/JJ
FCS => FCS/JJ
Human => Human/JJ
Molecular => Molecular/JJ
PKC-dependent => PKC-dependent/JJ
Stat1-dependent => Stat1-dependent/JJ
T-lymphoblastoid => T-lymphoblastoid/JJ
adult => adult/JJ
anti-Ig => anti-Ig/JJ|NN ♦anti- : depends on context, if prenominal it's JJ but if it's nominal head it's NN
bHLH => bHLH/JJ ♦basic helix-loop-helix
basal => basal/JJ
bovine => bovine/JJ
chromosomal => chromosomal/JJ
chronic => chronic/JJ
cis => cis/JJ
dependent => dependent/JJ
dibutyryl => dibutyryl/JJ
dificient => dificient/JJ
dominant-negative => dominant-negative/JJ
elevated => elevated/JJ
endocrine => endocrine/JJ
erythroid => erythroid/JJ
granulocyte-macrophage => granulocyte-macrophage/JJ
helix-loop-helix => helix-loop-helix/JJ
heterozygous => heterozygous/JJ
heterozygous => heterozygous/JJ
homozygous => homozygous/JJ
human => human/JJ
immune => immune/JJ
inflammatory => inflammatory/JJ
kinase-dependent => kinase-dependent/JJ
located => located/JJ
lymphoid => lymphoid/JJ
mature => mature/JJ
mononuclear => mononuclear/JJ
murine => murine/JJ
nevoid => nevoid/JJ
novel => novel/JJ
novel => novel/JJ
parental => parental/JJ
present => present/JJ
present => present/JJ
related => related/JJ
related => related/JJ
resistant => resistant/JJ
rheumatoid => rheumatoid/JJ
sIgD-crosslinking => sIgD-crosslinking/JJ
sIgM-crosslinking => sIgM-crosslinking/JJ
semimature => semimature/JJ
synovial => synovial/JJ
terminal => terminal/JJ
terminal => terminal/JJ
tissue-specific => tissue-specific/JJ
tolerant => tolerant/JJ
tonsillar => tonsillar/JJ
tonsillar => tonsillar/JJ
trans-acting => trans-acting/JJ
trans => trans/JJ
transcription-independent/JJ => transcription-independent/JJ
transient => transient/JJ
umbilical => umbilical/JJ
unilineage => unilineage/JJ
zinc-finger => zinc-finger/JJ

Verbs

evaginate => evaginate/VBP|VB

Names with Hyphens: treated as nouns

(AOP)-RANTES => (AOP)-RANTES/NN
12-myristate => 12-myristate/NN
13-acetate => 13-acetate/NN
20-Epi => 20-Epi/NN
20-epi => 20-epi/NN
25-Dihydroxyvitamin => 25-Dihydroxyvitamin/NN
3'-UTR => 3'-UTR/NN
32Dcl3 => 32Dcl3/NN
6TG1.1 => 6TG1.1/NN
AIR-1 => AIR-1/NN
Academic Press => Academic/NNP Press/NNP
B*CaM-K => B*CaM-K/NN
BCL-2 => BCL-2/NN
CDR2-loop => CDR2-loop/NN
CTLA-4 => CTLA-4/NN
D1/CDR2-loop => D1/CDR2-loop/NN
D1/CDR3-loop => D1/CDR3-loop/NN
FGF-1 => FGF-1/NN
Fli-1 => Fli-1/NN
G-CSF => G-CSF/NN
GSK-3beta => GSK-3beta/NN
Glucocorticoid => Glucocorticoid/NN
HCD57-SREI => HCD57-SREI/NN ♦name of a cell line
HES-1 => HES-1/NN
HL60 => HL60/NN
HS-72 => HS-72/NN
HT-2 => HT-2/NN
HUVECs => HUVECs/NNS
Hepa-1 => Hepa-1/NN
ICAM-1 => ICAM-1/NN
IFNs => IFNs/NNS
IL-12R => IL-12R/NN
IL-1beta => IL-1beta/NN
IL-1beta => IL-1beta/NN
IL-2R => IL-2R/NN
IL- => IL-/NN
JNK => JNK/NN
LEF-1 => LEF-1/NN
LEF-1 => LEF-1/NN
LMP-1 => LMP-1/NN
LP => LP/NN
M4 => M4/NN
MAPK => MAPK/NN
MCP-1 => MCP-1/NN
ML-1 => ML-1/NN
MO7e => MO7e/NN ♦name of a cell line
NAD(P)H:quione => NAD(P)H:quione/NN
NF-AT- => NF-AT-/NN
NF-ATc => NF-ATc/NN
NF-ATc => NF-ATc/NN
NK => NK/NN
NSAIDs => NSAIDs/NNS
Notch1/TAN-1 => Notch1/TAN-1/NN
OH => OH/NN
PGG-Glucan => PGG-Glucan/NN
SHP-1 => SHP-1/NN
SOCS-3 => SOCS-3/NN
STAT-5 => STAT-5/NN
STAT5a => STAT5a/NN
Spi-1 => Spi-1/NN
T => T/NN
TNF => TNF/NN
TNF => TNF/NN
U-937 => U-937/NN
UF-1 => UF-1/NN
VCAM-1 => VCAM-1/NN
VCAM-1 => VCAM-1/NN
X-chromosome => X-chromosome/NN
activin => activin/NN
aminooxypentane => aminooxypentane/NN
anorexia => anorexia/NN
antigen => antigen/NN
bcl-xL => bcl-xL/NN
beta-catenin => beta-catenin/NN
beta-globin => beta-globin/NN
beta-hydroxysteroid => beta-hydroxysteroid/NN
breast => breast/NN
c-Myc => c-Myc/NN
c-myc => c-myc/NN
cytochalasin => cytochalasin/NN
dehydrastase/dimerization => dehydrastase/dimerization/NN
elatase => elatase/NN
factor-1 => factor-1/NN
fibroblast => fibroblast/NN
fos => fos/NN
glycosphingolipid => glycosphingolipid/NN
hIL-5 => hIL-5/NN
haemoglobin => haemoglobin/NN
hybrid => hybrid/NN
hydroxysteroid => hydroxysteroid/NN
immunoreceptor => immunoreceptor/NN
interleukin-10 => interleukin-10/NN
interleukin-1 => interleukin-1/NN
interleukin-3 => interleukin-3/NN
interleukin => interleukin/NN
isoforms => isoforms/NNS (sometimes VBZ)
lipid => lipid/NN
macrophage => macrophage/NN
mg-1 => mg-1/NN
mispairing => mispairing/NN
molecule-1 => molecule-1/NN
nucleotide => nucleotide/NN
p21 => p21/NN
p27 => p27/NN
p45 => p45/NN
p62 => p62/NN
p70(S6)-kinase => p70(S6)-kinase/NN
p70 => p70/NN
pRb => pRb/NN
phi => phi/NN
prednisolone => prednisolone/NN
pro-interleukin => pro-interleukin/NN
proIL-1beta => proIL-1beta/NN
proteasome => proteasome/NN
purine-box/antigen => purine-box/antigen/NN
rapamycin => rapamycin/NN
ras => ras/NN
s6k => s6k/NN
s => s/NNS
sera => sera/NNS (plural of serumj
serine/threonine => serine/threonine/NN
signal => signal/NN
stimulus => stimulus/NN
thiol/disulfide => thiol/disulfide/NN
thrombin => thrombin/NN
tumor => tumor/NN
1 alpha,25(OH)2D3 => 1/CD alpha,25(OH)2D3/NN
1 alpha,25-(OH)2D3 => 1/CD alpha,25-(OH)2D3/NN
1 alpha,25-dihydroxyvitamin D3 => 1/CD alpha,25-dihydroxyvitamin/NN D3/NN
1,25 dihydroxyvitamin D3 => 1,25/CD dihydroxyvitamin/NN D3/NN
1,25(OH)2D3 => 1,25(OH)2D3/NN
1,25-(OH)2D3 => 1,25-(OH)2D3/NN
1,25-dihydroxycholecalciferol => 1,25-dihydroxycholecalciferol/NN
1,25-dihydroxyvitamin D3 => 1,25-/CD dihydroxyvitamin/NN D3/NN
1,25-dihydroxyvitamin D3 => 1,25-dihydroxyvitamin/NN D3/NN
2,3,7,8-tetrachlorodibenzo-p-dioxin => 2,3,7,8-tetrachlorodibenzo-p-dioxin/NN
3-kinase => 3-kinase/NN
5'-(N-ethyl)-carboxamido adenosine => 5'-(N-ethyl)-carboxamido adenosine/NN
5'-flanking region => 5'-flanking/JJ region/NN
ADP-ribose => ADP-ribose/NN
AIDS => AIDS/NN
AIR-1 => AIR-1/NN
AP-1 element => AP-1/NN element/NN
AP-1 site => AP-1/NN site/NN
AP-1 => AP-1/NN name of transcription factor
AP-2 => AP-2/NN name of transcription factor
AP1 => AP1/NN ♦ name of transcription factor
APL => APL/NN ♦ leukemia
AT2.5 => AT2.5/NN ♦cell line
ATF-1 => ATF-1/NN
ATRA => ATRA/NN ♦ all-trans retinoic acid
Ab => Ab/NN ♦ antibody
Abs => Abs/NNS ♦antinodies
Ah-recepter => Ah-recepter/NN
Ah => Ah/NN
B cell => B/NN cell/NN
B cells => B/NN cells/NNS
B lymphocyte => B/NN lymphocyte/NN
B lymphocytes => B/NN lymphocytes/NNS
B lymphoma => B/NN lymphoma/NN
B*CaM-K => B*CaM-K/NN
B-cell => B-cell/NN
B-cells => B-cells/NNS
B104 => B104/NN ♦cell line
B95-8 => B95-8/NN
BALB/c 3T3 => BALB/c/NN 3T3/NN ♦cell line
BCL-6 => BCL-6/NN
BL2 => BL2/NN ♦cell line
BSAP => BSAP/NN
BW5147 => BW5147/NN ♦cell line
BZLF1 promoter => BZLF1/NN promoter/NN
BZLF1 => BZLF1/NN
Bcl-2 => Bcl-2/NN
CAT gene => CAT/NN gene/NN
CD14 => CD14/NN
CD40 => CD40/NN
CD4 => CD4/NN
CD8 => CD8/NN
CSF-1 => CSF-1/NN
Ca(2+)-ATPase => Ca(2+)-ATPase/NN
Ca(2+)-ionophore => Ca(2+)-ionophore/NN
Ca2+ => Ca2+/NN
Cp => Cp/NN
CsA => CsA/NN
DM => DM/NN
DN => DN/JJ ♦doninant-negative
DNA binding => DNA/NN binding/NN
DNA sequences => DNA/NN sequences/NNS
DNA DNA/NN
DNase-I => DNase-I/NN
E(gre) => E(gre)/NN
E1A => E1A/NN
E2F-1 => E2F-1/NN
E2F => E2F/NN
EBNA-2 => EBNA-2/NN
EBNA2 => EBNA2/NN
EBV => EBV/NN ♦Epstein-Barr Virus
EGR-1 => EGR-1/NN
ER => ER/NN
Egr-1 => Egr-1/NN
Elf-1 => Elf-1/NN
Epstein-Barr virus => Epstein-Barr/JJ virus/NN ♦Epstein, Barr : person
FDC => FDC/NN
FK506 => FK506/NN
Fos => Fos/NN
G(Anh)MTetra => G(Anh)MTetra/NN
GATA-1 gene => GATA-1/NN gene/NN
GATA-1 => GATA-1/NN
GC box => GC/NN box/NN
GCR => GCR/NN
GD3 => GD3/NN
GM-CSF gene => GM-CSF/NN gene/NN
GM-CSF => GM-CSF/NN
GP+E-86 => GP+E-86/NN ♦cell line
GR mRNA => GR/NN mRNA/NN
GR => GR/NN
H2O2 => H2O2/NN
HB24 => HB24/NN
HCD57-SREI => HCD57-SREI/NN
HES-1 => HES-1
HIV enhancer => HIV/NN enhancer/NN
HIV long terminal repeat => HIV/NN long/JJ terminal/JJ repeat/NN
HIV-1 LTR => HIV-1/NN LTR/NN
HIV-1 LTR => HIV-1/NN LTR/NN ♦human immunodeficiency virus long term repeat
HIV-1 => HIV-1/NN ♦human immunodeficiency virus I
HIV-TF1 => HIV-TF1/NN
HIV => HIV/NN ♦human immunodeficiency virus
HL-60 => HL-60/NN ♦cell line
HLA-DR and -DP => HLA-DR/NN and/CC -DP/NN
HLA-DRA and DQB => HLA-DRA/NN and/CC DQB/NN
HPB.ALL => HPB.ALL/NN ♦cell line
HTLV-I => HTLV-I/NN ♦human T-cell leukemia virus
HUT78 => HUT78/NN ♦cell line
HeLa => HeLa/NN ♦cell line
Hepa-1 => Hepa-1/NN
I epsilon exon => I/NN epsilon/NN exon/NN
I kappa B alpha => I/NN kappa/NN B/NN alpha/NN
I kappa B-alpha => I/NN kappa/NN B-alpha/NN
I kappa B => I/NN kappa/NN B/NN
ICAM-1 => ICAM-1/NN
ICAM-1 => ICAM-1/NN
IFN alpha => IFN/NN alpha/NN
IFN-MCP => IFN-MCP/NN
IFN-alpha => IFN-alpha/NN
IFN-gamma => IFN-gamma/NN
IL-1 beta => IL-1/NN beta/NN ♦interleukin
IL-10 => IL-10/NN ♦interleukin
IL-12 => IL-12/NN ♦interleukin
IL-13R => IL-13R/NN
IL-1 => IL-1/NN
IL-2 gene => IL-2/NN gene/NN
IL-2 gene => IL-2/NN gene/NN
IL-2 promoter => IL-2/NN promoter/NN
IL-2 promoter => IL-2/NN promoter/NN ♦interleukin
IL-2 => IL-2/NN ♦interleukin
IL-2R alpha => IL-2R/NN alpha/NN ♦interleukin
IL-2R => IL-2R/NN
IL-3 => IL-3/NN
IL-4 gene => IL-4/NN gene/NN
IL-4 => IL-4/NN ♦interleukin
IL-4R => IL-4R/NN
IL-5 => IL-5/NN
IL-5 => IL-5/NN
IL-5 => IL-5/NN ♦interleukin
IL-6 gene => IL-6/NN gene/NN
IL-6 => IL-6/NN ♦interleukin
IL-8 => IL-8/NN ♦interleukin
IRF-1 => IRF-1/NN
Id3 => Id3/NN
Ig heavy chain mu E1 site => Ig/NN heavy/JJ chain/NN mu/NN E1/NN site/NN
IgD => IgD/NN
IgE => IgE/NN
IgH => IgH/NN
IgM => IgM/NN
J.Jhan => J.Jhan/NN ♦cell line
JAK3 => JAK3/NN
JNKs => JNKs/NNS
Jun => Jun/NN
JunD => JunD/NN
Jurkat => Jurkat/NN ♦cell line
K-562 => K-562/NN ♦cell line
K562 => K562/NN ♦cell line
LAK => LAK/NN
LFA-1 => LFA-1/NN
LMP1(1-231) => LMP1(1-231)/NN
LMP1 => LMP1/NN
LPS => LPS/NN
LTB4 => LTB4/NN
LTR => LTR/NN
LTR => LTR/NN
LYSP100 => LYSP100/NN
M-CSF => M-CSF/NN
M-TAT => M-TAT/NN
M1 => M1/NN ♦cell line
M4 => M4/NN
MCP-1 => MCP-1/NN
MEL => MEL/NN
MO7e => MO7e/NN
MR => MR/NN
NAC => NAC/NN
NB4 => NB4/NN ♦cell line
NF kappa B => NF/NN kappa/NN B/NN
NF-AT => NF-AT/NN
NF-ATp => NF-ATp/NN
NF-kB => NF-kB/NN
NF-kappa B p65 => NF-kappa/NN B/NN p65/NN
NF-kappa B site => NF-kappa/NN B/NN site/NN
NF-kappa B-dependent genes => NF-kappa/NN B-dependent/JJ genes/NNS
NF-kappa B => NF-kappa/NN B/NN
NF-kappaB => NF-kappaB/NN
NFAT-1 => NFAT-1/NN
NFAT1 => NFAT1/NN
NFAT => NFAT/NN
NFATp => NFATp/NN
NK => NK/NN
NS-Meg => NS-Meg/NN
Na(+)-H(+)-antiport => Na(+)-H(+)-antiport/NN
Na(+)-H+ antiport => Na(+)-H+/JJ|NN antiport/NN
Nef => Nef/NN
Northern blot => Northern/NN blot/NN
Northern blotting => Northern/NN blotting/NN
OBF-1 => OBF-1/NN
OCA-B => OCA-B/NN
OCT-1 => OCT-1/NN
Oct-1 => Oct-1/NN
Oct-2 => Oct-2/NN
P3HR-1 => P3HR-1/NN
PBL => PBL/NN
PGE2 => PGE2/NN
PHA => PHA/NN
PIH => PIH/NN
PKC-zeta => PKC-zeta/NN
PKC => PKC/NN
PMA => PMA/NN
PML/RARalpha => PML/RARalpha/NN
PML => PML/NN
PU.1 => PU.1/NN
R24 => R24/NN
RA => RA/NN
Raji => Raji/NN ♦cell line
Rb => Rb/NN
RelA => RelA/NN
RelB => RelB/NN
S phase => S/NN phase
SK-N-Be(2) => SK-N-Be(2)/NN ♦cell line
SLP-76 => SLP-76/NN
SP1 binding site => SP1/NN binding/NN site/NN
SRY => SRY/NN
STAT1 alpha => STAT1/NN alpha/NN
STAT3 => STAT3/NN
STAT4 => STAT4/NN
STAT5 => STAT5/NN
SV40 => SV40/NN
Southern blotting => Southern/NN blotting/NN
Southern blot => Southern/NN blot/NN
Southwestern blot => Southwestern/NN blot/NN
Southwestern blotting => Southwestern/NN blotting/NN
Sp1 site => Sp1/NN site/NN
Sp1 => Sp1/NN
Spi-B => Spi-B/NN
Stat1 => Stat1/NN
Stat3 => Stat3/NN
SupT1 => SupT1/NN ♦cell line
T cell activation => T/NN cell/NN activation/NN
T cell => T/NN cell/NN
T cells => T/NN cells/NNS
T lymphocyte => T/NN lymphocyte/NN
T lymphocytes => T/NN lymphocytes/NNS
T lymphoma => T/NN lymphoma/NN
T-cell activation => T-cell/NN activation/NN
T-cell genes => T-cell/NN genes/NNS
T-cell => T-cell/NN
T-cells => T-cells/NNS
T3 => T3/NN
TAL-1 => TAL-1/NN
TATA box => TATA/NN box/NN
TCDD => TCDD/NN
TCF-1 => TCF-1/NN
TCR2 => TCR2/NN
TCR => TCR/NN
TGF-beta => TGF-beta/NN
THP-1 => THP-1/NN ♦cell line
THP1 => THP1/NN ♦cell line
TNF alpha => TNF/NN alpha/NN
TNF gene => TNF/NN gene/NN
TNF-alpha => TNF-alpha/NN
TNFR1 => TNFR1/NN
TPA => TPA/NN
TRAF3 => TRAF3/NN
TRP => TRP/NN ♦cell line
Tax1 => Tax1/NN
Tax => Tax/NN
Thymocyte => Thymocyte/NN
Thymocytes => Thymocytes/NNS
U937 => U937/NN ♦cell line
UT-7 => UT-7/NN ♦cell line
VCAM-1 => VCAM-1
VD => VD/NN
VDR => VDR/NN
VitD3 => VitD3/NN
WEHI-3B(D+) => WEHI-3B(D+)/NN ♦cell line
Western blot => Western/NN blot/NN
Western blotting => Western/NN blotting/NN
X chromosome => X/NN chromosome/NN
X-chromosome => X-chromosome/NN
YT-N17 => YT-N17/NN ♦cell line
[3H]calcitriol/VDR => [3H]calcitriol/VDR/NN
[3H]calcitriol => [3H]calcitriol/NN
[3H]dexamethasone => [3H]dexamethasone/NN
[3H]thymidine => [3H]thymidine/NN
activated T cells => activated/VBN T/NN cells/NNS
acute => acute/JJ
adhesion molecules => adhesion/NN molecules/NNS
aldosterone => aldosterone/NN
alpha-methyl-1 alpha,25-(OH)2D3 => alpha-methyl-1/NN alpha,25-(OH)2D3/NN
alpha-tcp => alpha-tcp/NN
alpha-tosyl-L-lysine => alpha-tosyl-L-lysine/NN
alpha-tosylphenylalanyl => alpha-tosylphenylalanyl/JJ
androgen receptor gene => androgen/NN receptor/NN gene/NN
apoptosis => apoptosis/NN
atopic => atopic/JJ
autosome => autosome/NN
bcl-2 => bcl-2/NN
beta-globin => beta-globin/NN
bp => bp/NN
breast cancer => breast/NN cancer/NN
bronchoalveolar => bronchoalveolar/JJ
burst-forming unit-erythroid ( BFU-E ) => burst-forming/JJ unit-erythroid/NN (/( BFU-E/NN )/)
c-Fos => c-Fos/NN
c-Jun => c-Jun/NN
c-Myc => c-Myc/NN
c-Rel => c-Rel/NN
c-fes => c-fes/NN
c-fos gene => c-fos/NN gene/NN
c-fos => c-fos/NN
c-jun => c-jun/NN
c-kit => c-kit/NN
c-myb => c-myb/NN
c-myc => c-myc/NN
cAMP => cAMP/NN
cDNAs => cDNAs/NNS
calcineurin => calcineurin/NN
calcium ionophore => calcium/NN ionophore/NN
calcium => calcium/NN
cell lines => cell/NN lines/NNS
cells => cells/NNS
cellular gene => cellular/JJ gene/NN
chromatin => chromatin/NN
corticosteroid => corticosteroid/NN
cortisol => cortisol/NN
cycloheximide => cycloheximide/NN
cyclosporin A => cyclosporin/NN A/NN
cysteine => cysteine/NN
cytokine genes => cytokine/NN genes/NNS
cytokine => cytokine/NN
cytokines => cytokines/NNS
cytoplasm => cytoplasm/NN
dexamethasone => dexamethasone/NN
differentiation => differentiation/NN
downstream sequence => downstream/JJ sequence/NN
egr-1 => egr-1/NN
electrophoretic mobility shift assays => electrophoretic/JJ mobility/NN shift/NN assays/NNS
endothelial cells => endothelial/JJ cells/NNS
enzyme => enzyme/NN
expression => expression/NN
factor-1 => factor-1/NN
fluoroquinolone => fluoroquinolone/NN
forskolin => forskolin/NN
gamma(c) => gamma(c)
gelatinase => gelatinase/NN
gene expression => gene/NN expression/NN
gene transcription => gene/NN transcription/NN
genes => genes/NNS
globin genes => globin/NN genes/NNS
globin => globin/NN
glucocorticoid receptor => glucocorticoid/NN receptor/NN
glucocorticoid receptors => glucocorticoid/NN receptors/NNS
glucocorticoid => glucocorticoid/NN
glucocorticoids => glucocorticoids/NNS
granulocyte DNA => granulocyte/NN DNA/NN
granulocytes => granulocytes/NNS
hGR => hGR/NN
hIL-5 => hIL-5/NN
hematopoietic cells => hematopoietic/JJ cells/NNS
hemin => hemin/NN
histone H5 => histone H5/NN
human A gamma-globin gene => human/JJ A/NN gamma-globin/NN gene/NN
human T cells => human/JJ T/NN cells/NNS
human immunodeficiency virus long terminal repeat => human/JJ immunodeficiency/NN virus/NN long/JJ terminal/JJ repeat/NN
human immunodeficiency virus type 1 enhancer => human/JJ immunodeficiency/NN virus/NN type/NN 1/CD enhancer/NN
human immunodeficiency virus type 1 => human/JJ immunodeficiency/NN virus/NN type/NN 1/CD
human immunodeficiency virus => human/JJ immunodeficiency/NN virus/NN
human monocytes => human/JJ monocytes/NNS
immediate early gene => immediate/JJ early/JJ gene/NN
immediate-early gene => immediate-early/JJ gene/NN
immune response => immune/JJ response/NN
immune system => immune/JJ system/NN
immunoglobulin genes => immunoglobulin/NN genes/NNS
immunoglobulin => immunoglobulin/NN
inflammatory genes => inflammatory/JJ genes/NNS
inositol 1,4,5-trisphosphate/calcium => inositol/NN 1,4,5-trisphosphate/calcium/NN
inositol phosphate => inositol phosphate/NN
inositol-1,4,5-trisphosphate => inositol-1,4,5-trisphosphate/NN
interferon => interferon/NN
interleukin-2 enhancer => interleukin-2/NN enhancer/NN
interleukin-2 gene => interleukin-2/NN gene/NN
interleukin-2 => interleukin-2/NN
interleukin-4 => interleukin-4/NN
interleukin-5 => interleukin-5/NN
interleukin-6 => interleukin-6/NN
interleukin-8 => interleukin-8/NN
islet => islet/NN
kappa B sequence => kappa/NN B/NN sequence/NN
leukocyte => leukocyte/NN
lining cell => lining/JJ cell
lining region => lining/JJ region
lipopolysaccharide => lipopolysaccharide/NN
long terminal repeat => long/JJ terminal/JJ repeat/NN
lymph-node => lymph-node/NN
lymph => lymph/NN
lymphocyte => lymphocyte/NN
lymphocytes => lymphocytes/NNS
lymphokine genes => lymphokine/NN genes/NNS
lyn => lyn/NN
mAb => mAb/NN
mAbs => mAbs/NNS
mRNA => mRNA/NN
macrophages => macrophages/NNS
mast cell => mast/NN cell/NN
megakaryocyte => megakaryocyte/NN
mg-1 => mg-1/NN
microM => microM/NN
mineralocorticoid => mineralocorticoid/NN
molecular => molecular/JJ
molecule-1 => molecule-1/NN
monocytes => monocytes/NNS
mononuclear leukocytes => mononuclear/JJ leukocytes/NNS
myb => myb/NN
necrosis => necrosis/NN
nef => nef/NN
neutrophils => neutrophils/NNS
nuclear extracts => nuclear/JJ extracts/NNS
nuclear factor kappa B => nuclear/JJ factor/NN kappa/NN B/NN
nuclear factor of activated T cells => nuclear/JJ factor/NN of/IN activated/VBN T/NN cells/NNS
nuclear factors => nuclear/JJ factors/NNS
nuclear translocation => nuclear/JJ translocation/NN
nucleotide => nucleotide/NN
octamer motif => octamer/NN motif/NN
octamer => octamer/NN
okadaic acid => okadaic/JJ acid/NN
oligoadenylate synthetase => oligoadenylate/NN synthetase/NN
oncoprotein => oncoprotein/NN
oriP => oriP/NN
p100 => p100/NN
p105 => p105/NN
p38 => p38/NN
p50 => p50/NN
p52 => p52/NN
p65 => p65/NN
patient => patient/NN
patients => patients/NNS
peptide => peptide/NN
peri-kappa B => peri-kappa/NN B/NN
peripheral blood => peripheral/JJ blood/NN
peroxisome => peroxisome/NN ♦sub-cell source
phagocyte => phagocyte/NN
phorbol ester => phorbol/NN ester/NN
phorbol esters => phorbol/NN esters/NNS
phosphate => phosphate/NN
phosphatidylinositol => phosphatidylinositol/NN
phytohemagglutinin => phytohemagglutinin/NN
plaque => plaque/NN
plasma cortisol => plasma/NN cortisol/NN
platelet-activating factor => platelet-activating/JJ factor/NN
pp56(lck) => pp56(lck)/NN
poly(ADP-ribose) polymerase => poly(ADP-ribose)/JJ polymerase/NN
prednisolone => prednisolone/NN
prodigiosin => prodigiosin/NN
proliferation => proliferation/NN
promoter => promoter/NN
promoter => promoter/NN
promoters => promoters/NNS
promoters => promoters/NNS
protein kinase C => protein/NN kinase/NN C/NN
protein-protein => protein-protein/JJ
proto-oncogenes => proto-oncogenes/NNS
protooncogene c-fos => protooncogene/NN c-fos/NN
rapamycin => rapamycin/NN
receptors => receptors/NNS
sIgD => sIgD/NN
sIgM => sIgM/NN
signaling pathway => signaling/NN pathway/NN
signaling pathways => signaling/NN pathways/NNS
signaling receptor => signaling/NN receptor
signalling pathway => signalling/NN pathway/NN
signalling pathways => signalling/NN pathways/NNS
sn-1,2 dioctanoyl glycerol => sn-1,2/NN dioctanoyl/JJ glycerol/NN
steroid => steroid/NN
substrates => substrates/NNS
tal-1 => tal-1/NN
therapy => therapy/NN
thrombin => thrombin/NN
thymocyte => thymocyte/NN
thymocytes => thymocytes/NNS
transcription factor => transcription/NN factor/NN
transcription factors => transcription/NN factors/NNS
transcription => transcription/NN
transcriptional activation => transcriptional/JJ activation/NN
transgenic mice => transgenic/JJ mice/NNS
tumor necrosis factor alpha => tumor/NN necrosis/NN factor/NN alpha/NN
tumor => tumor/NN
tumour => tumour/NN
tyrosine kinase => tyrosine/NN kinase/NN
tyrosine phosphorylation => tyrosine/NN phosphorylation/NN
tyrosine => tyrosine/NN
upstream sequence => upstream/JJ sequence/NN
v-abl => v-abl/NN
v-abl => v-abl/NN
vitamin => vitamin/NN

MEDLINE Database Flags

UI - => UI/LS -/:
TI - => TI/LS -/:
AB - => AB/LS -/: