5.9.1. Correct Spelling (High-Speed clip0090 action)

<< Click to Display Table of Contents >>

Navigation:  5. Detailed description of the Actions > 5.9. Text Mining >

5.9.1. Correct Spelling (High-Speed clip0090 action)

 
Icon: ANATEL~3_img694

 
Function: CorrectSpelling
 

Property window:

 

clip0233

 

Short description:

Correct Spelling-Mistakes in text fields.

 

Long Description:

Anatella include an operator that checks & corrects the spelling mistakes in any text field. For example, let’s assume that your database contains a field named “City of Birth”. This field will usually contains many different orthography (i.e. spelling error) of the same city. The ANATEL~3_img694 CorrectSpelling Action will detect and correct these errors automatically. It’s typically used to “clean” the database to get better reports, better predictive models, etc.

 

ANATEL~2_img8

For example, the city "RIO DE JANEIRO" can be mis-spelled in a number of different ways (this is a real-world example):

 

RIO DXE JANEIRO, RIO DE JAEIRO, RIOP DE JANEIRO, RIO NDE JANEIRO, RIO DEJANEIRO, `RIO DE JANEIRO, RIO DE JANIRO, RIO DE JANEI RO, RIO DE JANEIRIO, RI0 DE JANEIRO, RIO DE JNEIRO, RIO DE JANEEIRO, RIO DE JANEIROO, RIO DE JANAEIRO, RIO DE JANEIROR, RIO DE JANEIRO RJ

 
 

This action can operate in two different modes:
 

1.You don’t have any reference table.

2.You have a reference table (For example, you have a table that contains the exact orthography of all the possible city names).

 

Here are the parameters of this Action:

ANATEL~3_img693