Lip Features Automatic Extraction
Abstract
An algorithm for speaker's lip segmentation and features extraction is presented in this paper. A color video sequence of speaker's face is acquired, under natural lighting conditions and without any particular make-up. First, a logarithmic color transform is performed from RGB to HI (hue, intensity) color space. Second, a statistical approach using Markov random field modeling determines red hue prevailing region and motion in a spatiotemporal neighborhood. Third, the final label field is used to extract ROI (Region Of Interest) and geometrical features.