I read some presentations and papers regarding smoothing techniques.
- Smoothing N-gram Language models
- An Empirical Study of Smoothing Techniques for Language Modeling
- N-gram models
- Improved Smoothing for N-gram Language Models Based on Ordinary Counts
- Smoothing Language Models
- NLP Lunch Tutorial: Smoothing
- Language models
I want to apply Smoothing on a data, containing zero values. Which one should be the best?
This is just an example:
Pathway1 Pathway2 Pathway3 Pathway4 Calcium ions 0 3 1 0 ATP 2 1 0 7