Neural Dependency Parsing

寫在前面

  1. 關(guān)于parsing,目前有兩種主流的方法,一種是Constituency Parsing,另一種是Dependency Parsing,在本文中我們主要討論Dependency Parsing
  2. 知識點(diǎn)參見: 阿衡學(xué)姐的筆記 Dependency Parsing and Treebank
  3. Stanford cs224N講義: Dependency Parsing
  4. Basic論文: A Fast and Accurate Dependency Parser using Neural Networks,基于greedy transition-based parsing,結(jié)合了word_embeddingneural networks
  5. Advanced論文
    Globally Normalized Transition-Based Neural Networks
    Universal Dependencies: A cross-linguistic typology
    Incrementality in Deterministic Dependency Parsing
    主要改進(jìn)點(diǎn)有: 使用更深層次的神經(jīng)網(wǎng)絡(luò),處理non-projectivity問題,使用graph-based parsing
  6. 基礎(chǔ)項(xiàng)目: Neural Dependency Parsing

Neural transition-based Dependency Parsing

Conventional transition-based Dependency Parsing

  1. Structure: Stack s, Buffer b and A(record dependencies)
  2. At each move, use a discriminative classifier(such as SVM) to decide what kind of operation next time
transition_based parsing.png

Feature Extract

  1. In conventional ways, we represent word|tag|label as one-hot vectors, which is sparse and costs much time
  2. In Neural transition-based dependency parsing model, we represent word|tag|label as pre-trained word_embeddings
feature_extracted.png

Choices of featurs

The choice of Sw, St, Sl
Following (Zhang and Nivre, 2011), we pick a rich set of elements for our final parser. In de- tail, Sw contains nw = 18 elements: (1) The top 3 words on the stack and buffer: s1, s2, s3, b1, b2, b3; (2) The first and second leftmost / rightmost children of the top two words on the stack: lc1(si), rc1(si), lc2(si), rc2(si), i = 1, 2. (3) The leftmost of leftmost / rightmost of right- most children of the top two words on the stack: lc1(lc1(si)), rc1(rc1(si)), i = 1, 2.
We use the corresponding POS tags for St (nt = 18), and the corresponding arc labels of words excluding those 6 words on the stack/buffer for Sl (nl = 12). A good advantage of our parser is that we can add a rich set of elements cheaply, instead of hand-crafting many more indicator fea- tures.

Neural Networks

after word_embedding,we feed the input to hidden-layer, which has a novel activation function(f(x) = x^3),then use a softmax-layer to predict next transition


neural_model.png
最后編輯于
?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時(shí)請結(jié)合常識與多方信息審慎甄別。
平臺(tái)聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點(diǎn),簡書系信息發(fā)布平臺(tái),僅提供信息存儲(chǔ)服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

  • 2018.5.19 今天周六,想要睡到自然醒,六點(diǎn)半閨女就跑我床前...
    張婉婷a閱讀 279評論 0 0
  • 轉(zhuǎn)眼間,高考已過境四月有余。依稀記得六月時(shí)的我們,吹著夏天的風(fēng)流著離別的淚說著不舍的...
    7號橘子橘了閱讀 322評論 0 1
  • 4月9日晨讀感悟(No.20) 昨天跟社群的一位朋友聊天,她想知道怎么能夠更有效的在社群里學(xué)習(xí),我覺得今天的晨讀給...
    楚丹丹閱讀 308評論 1 5
  • 歸鳥頡頏忽不見,眼眸盡處是南山。 熱霞襲卷天光去,冷日飛奪若木還。
    南山野客閱讀 255評論 0 2

友情鏈接更多精彩內(nèi)容