Learn Before
Concept

Noisy channel algorithm for real-word correction

The algorithm takes the input sentence X={x1,x2,...,xk,...,xn}X=\{x_{1}, x_{2},..., x_{k},..., x_{n}\} and generates a set of candidate words C(xi)C(x_{i}) for each word xix_{i}. The candidates include every word with a small edit distance from xix_{i}. Then we pick the sentence with the highest language model probability. Each sentence is scored by the noisy channel model as follows:

Image 0

0

1

Updated 2026-06-15

Tags

Data Science