3. The Rhetorical Structure Theory [25] and [26] framework provides a well defined way of expressing discourse-level rhetorical relationships between utterances. The textual realisation IWR-1 of RST relations is not domain-specific, therefore the specific generation rules can be applied equally for the generation of medical summaries as well as any other type of English text. The RST framework is particularly suited to our specific application since the relations between chronicle events map naturally to RST schemas
(e.g., we express facts such as inference (an event led to another) or causality (an event causes another)). Saying it: Starting from a plan distributing the content among paragraphs and sentences, with some linking phrases and formatting already specified, a template-based grammar generates the surface forms of the sentences, producing as output a complete specification of the text. In our example, a template would map the domain-specific relationship inferences(biopsy, cancer) The text generation system uses two types selleck products of grammar rules for realising the summaries. Firstly,
a large standard generative grammar for English phrases and sentences, which consists of generic rules such as: definite noun phrases = [definite article] + [determiner] + [noun] (for phrases) or causal relation = main clause + causal connector + subordinate clause (for sentences). This helps generating constructs such as “the clinical diagnosis” or “the patient underwent chemotherapy because of the cancer”. These rules are static and independent of any new information available to the generation system, therefore no effort is involved
in Aprepitant enhancing the rules when new data becomes available to the system. The second set of generation rules are specific to the medical domain and more restricted in size. They govern the way the system expresses connections between words in the vocabulary, for example, the fact that the correct way of expressing an event of type surgical procedure is “the patient underwent surgery”. These rules are partially static in that they do not require re-writing or enhancing if we see new, unknown words which belong to a category known by the system (e.g., the fact that “mastectomy” is a brand new word of type surgical procedure doesn’t require rewriting the rules for surgical procedure. However, if the type of events in the Chronicle changes (e.g., if the system were to be applied to a new, non-medical, domain), we would need to manually create generation rules for each new type of event. Can these automatically generated summaries perform a useful role in the clinical setting? We explored this question through a formal study with twenty-one clinicians at a teaching hospital.