18. Applicaton example: Photo OCR

Applicaton example: Photo OCR

Problem description and pipeline

Photo OCR pipeline:

  1. Text detection
  2. Character segmentation
  3. Character classification

Sliding windows

Text detection:

sliding window detection:

  • step-size/stride

Character segmentation:

1D Sliding window for character segmentation

Getting lots of data: Artificial data synthesis

  • Create new data.
  • Synthesizing data by introducing distortions
    • Distortion introduced should be representation of the type of noise/distortions in the test set.
    • Usually does not help to add purely random/meaningless noise to your data.

Ceiling analysis: What part of the pipeline to work on next

Estimating the errors due to each compoent

What part of the pipeline should you spend the most time trying to improve?

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時請結(jié)合常識與多方信息審慎甄別。
平臺聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

  • 今天感恩節(jié)哎,感謝一直在我身邊的親朋好友。感恩相遇!感恩不離不棄。 中午開了第一次的黨會,身份的轉(zhuǎn)變要...
    余生動聽閱讀 10,798評論 0 11
  • 彩排完,天已黑
    劉凱書法閱讀 4,452評論 1 3
  • 表情是什么,我認為表情就是表現(xiàn)出來的情緒。表情可以傳達很多信息。高興了當(dāng)然就笑了,難過就哭了。兩者是相互影響密不可...
    Persistenc_6aea閱讀 129,413評論 2 7

友情鏈接更多精彩內(nèi)容