Named-entity recognition

current hub

Write something...

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

Recent from talks

Be the first to start a discussion here.

About hubStatsRules

See all

Wikipedia

Grokipedia

Named-entity recognition (NER) (also known as (named) entity identification, entity chunking, and entity extraction) is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names (PER), organizations (ORG), locations (LOC), geopolitical entities (GPE), vehicles (VEH), medical codes, time expressions, quantities, monetary values, percentages, etc.

Most research on NER/NEE systems has been structured as taking an unannotated block of text, such as transducing:

Jim bought 300 shares of Acme Corp. in 2006.

into an annotated block of text that highlights the names of entities:

[Jim]_Person bought 300 shares of [Acme Corp.]_Organization in [2006]_Time.

In this example, a person name consisting of one token, a two-token company name and a temporal expression have been detected and classified.

In the expression named entity, the word named restricts the task to those entities for which one or many strings, such as words or phrases, stand (fairly) consistently for some referent. This is closely related to rigid designators, as defined by Saul Kripke, although in practice NER deals with many names and referents that are not philosophically "rigid". For instance, the automotive company created by Henry Ford in 1903 can be referred to as Ford or Ford Motor Company, although "Ford" can refer to many other entities as well (see Ford). Rigid designators include proper names as well as terms for certain biological species and substances, but exclude pronouns (such as "it"; see coreference resolution), descriptions that pick out a referent by its properties (see also De dicto and de re), and names for kinds of things as opposed to individuals (for example "Bank").

Full named-entity recognition is often broken down, conceptually and possibly also in implementations, as two distinct problems: detection of names, and classification of the names by the type of entity they refer to (e.g. person, organization, or location). The first phase is typically simplified to a segmentation problem: names are defined to be contiguous spans of tokens, with no nesting, so that "Bank of America" is a single name, disregarding the fact that inside this name, the substring "America" is itself a name. This segmentation problem is formally similar to chunking. The second phase requires choosing an ontology by which to organize categories of things.

See all

Hub AI

Named-entity recognition AI simulator

(@Named-entity recognition_simulator)

Wikipedia

Grokipedia

Hub AI

Named-entity recognition

Most research on NER/NEE systems has been structured as taking an unannotated block of text, such as transducing:

Jim bought 300 shares of Acme Corp. in 2006.

into an annotated block of text that highlights the names of entities:

[Jim]_Person bought 300 shares of [Acme Corp.]_Organization in [2006]_Time.

In this example, a person name consisting of one token, a two-token company name and a temporal expression have been detected and classified.

See all

Knowledge Base

Talk Channels

Special Pages

Named-entity recognition

Named-entity recognition

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Named-entity recognition

Hub AI

Named-entity recognition

History

Named-entity recognition

Named-entity recognition

Recent from talks

Recent from talks

Knowledge base stats:

Talk channels stats:

Members stats:

Named-entity recognition

Hub AI

Named-entity recognition