A probabilistic framework for automatic term recognition

    Research output: Contribution to journalArticlepeer-review

    15 Citations (Scopus)

    Abstract

    Term recognition identifies domain-relevant terms which are essential for discovering domain concepts and for the construction of terminologies required by a wide range of natural language applications. Many techniques have been developed in an attempt to numerically determine or quantify termhood based on term characteristics. Some of the apparent shortcomings of existing techniques are the ad-hoc combination of termhood evidence, mathematically-unfounded derivation of scores and implicit assumptions concerning term characteristics. We propose a probabilistic framework for formalising and combining qualitative evidence based on explicitly defined term characteristics to produce a new termhood measure. Our qualitative and quantitative evaluations demonstrate consistently better precision, recall and accuracy compared to three other existing ad-hoc measures.
    Original languageEnglish
    Pages (from-to)499-539
    JournalIntelligent Data Analysis
    Volume13
    Issue number4
    DOIs
    Publication statusPublished - 2009

    Fingerprint

    Dive into the research topics of 'A probabilistic framework for automatic term recognition'. Together they form a unique fingerprint.

    Cite this