關(guān)于條件隨機(jī)場的問題和理解

將一下三塊內(nèi)容結(jié)合起來看比較好:

條件隨機(jī)場的精彩總結(jié)性論述,更多詳細(xì)信息可以參閱連接文章:http://homepages.inf.ed.ac.uk/csutton/publications/crftut-fnt.pdf?
????Much work in learning with graphical models, especially in statistical natural-language processing, has focused on generative models that explicitly attempt to model a joint probability distribution p(y,x) over inputs and outputs. Although this approach has advantages, it also has important limitations. Not only can the dimensionality of x be very large, but the features may have complex dependencies, so constructing a probability distribution over them is difficult. Modeling the dependencies among inputs can lead to intractable models, but ignoring them can lead to reduced performance.

????A solution to this problem is a discriminative approach, similar to that taken in classifiers such as logistic regression. Here we model the conditional distribution p(y|x) directly, which is all that is needed for classification. This is the approach taken by conditional random fields (CRFs). CRFs are essentially a way of combining the advantages of discriminative classification and graphical modeling, combining the ability to compactly model multivariate outputs y with the ability to leverage a large number of input features x for prediction. The advantage to a conditional model is that dependencies that involve only variables in x play no role in the conditional model, so that an accurate conditional

李航-統(tǒng)計(jì)學(xué)習(xí)-線性鏈條件隨機(jī)場
李航的書中對線性鏈條件隨機(jī)場定義為如下所示,一開始難以理解,后來終于搞懂,式子中,索引i代表輸出序列的位置索引,k代表兩個輸出序列之間的轉(zhuǎn)移特征索引,即這種轉(zhuǎn)移可能有多個特征,l代表每個輸出序列位置對應(yīng)的特征索引,即序列中每個節(jié)點(diǎn)也可能有多個特征。t和s只取1或0表征的是此特征是否存在,前面的lamda和mu表征的是每個特征的權(quán)重,這就和下面的神經(jīng)網(wǎng)絡(luò)聯(lián)系在了一起。k個特征表征的是輸出維度,每個維度的標(biāo)量數(shù)值通過此處的權(quán)重來表征。


youtue --?https://www.youtube.com/watch?v=PGBlyKtfB74&list=PL6Xpj9I5qXYEcOhn7TqghAJ6NAPrNmUBH&index=19
下圖為上面視頻中的截圖,利用神經(jīng)網(wǎng)絡(luò)來獲取節(jié)點(diǎn)特征向量,利用矩陣V表征狀態(tài)間的轉(zhuǎn)移特征,李航的書中也有類似的矩陣表示,但是書中的矩陣與此處的矩陣有所不同,具體可以參考原書。

最后CRF是判別模型,概率計(jì)算采用前向后向算法,訓(xùn)練采用極大似然估計(jì)訓(xùn)練,預(yù)測采用維特比算法。目前對書中算法的細(xì)節(jié)沒有仔細(xì)研讀,等有時間再細(xì)看。

最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時請結(jié)合常識與多方信息審慎甄別。
平臺聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡書系信息發(fā)布平臺,僅提供信息存儲服務(wù)。

友情鏈接更多精彩內(nèi)容