2018-06-06

數(shù)據(jù)挖掘技術(shù)在醫(yī)學(xué)數(shù)據(jù)中的應(yīng)用
中文摘要
隨著大數(shù)據(jù)技術(shù)與人工智能技術(shù)的發(fā)展,數(shù)據(jù)挖掘技術(shù)被應(yīng)用在越來越多的領(lǐng)域之中,其中不乏金融、教育、醫(yī)療等行業(yè)。其中,在醫(yī)療行業(yè)的應(yīng)用上又包括精準(zhǔn)醫(yī)療、基因工程、基因測序等學(xué)科前沿領(lǐng)域中。本文則是以數(shù)據(jù)挖掘的模型算法在醫(yī)學(xué)臨床數(shù)據(jù)和醫(yī)院信息系統(tǒng)數(shù)據(jù)中所發(fā)揮的作用進行了論述。
數(shù)據(jù)挖掘技術(shù)在醫(yī)學(xué)數(shù)據(jù)中應(yīng)用的目的是從大量的醫(yī)學(xué)數(shù)據(jù)中挖掘出潛在的且與致病有關(guān)的因素,并且在此過程中獲取到更多的信息、模型、關(guān)聯(lián)規(guī)則等,將這些挖掘出的成果應(yīng)用于臨床,從而能夠幫助醫(yī)生進行更快更準(zhǔn)的疾病判斷。本文的主要工作如下:
首先,本文第二章詳細(xì)闡述了醫(yī)學(xué)數(shù)據(jù)的特點以及常用的數(shù)據(jù)挖掘算法的理論基礎(chǔ),方法結(jié)構(gòu)。還介紹了各種數(shù)據(jù)挖掘模型的簡單解釋。
其次,本文主要通過一個乳腺癌相關(guān)的醫(yī)學(xué)數(shù)據(jù)集,探索了數(shù)據(jù)挖掘中的logistic回歸分析預(yù)測和隨機森林(決策樹)分類預(yù)測技術(shù)在醫(yī)學(xué)數(shù)據(jù)上的分類功能。并在分類結(jié)果上取得較好的分類精確度。之后可以作為輔助醫(yī)生的一種診斷方案,對被預(yù)測得乳腺癌概率較高的患者可以重點觀察,重點診斷。
最后,本文對兩個數(shù)據(jù)集中所得出的分類和預(yù)測結(jié)果進行解釋說明,并提出相關(guān)的對策和改進意見。并在文末提出了關(guān)于本文的不足與將來進行改進的方向。

關(guān)鍵詞:數(shù)據(jù)挖掘;回歸分析;決策樹;乳腺癌

The application of data mining technology in medical data.
Abstract in Chinese
The application of data mining has become a hot topic with the development of big data technology and Artificial Intelligence Technology, and it has been applied in a great many fields, such as financial industry, educational industry, healthcare industry and other industries. Among them, the application of healthcare industry covers precision medicine, gene engineering,gene sequencing and other frontier fields . This article fully discusses the role of model algorithm of data mining in medical clinical data and hospital information system data.
The purpose of data mining technology applied in the medical data is to dig out the potential factors that are related to the disease from a large number of medical data, and to get more information, models, association rules and so on from the process. the excavated achievements are used for clinical medicine ,which can help doctors to judge disease faster and more accurate . The main work of this article is as follows:
First of all, the second chapter ot this article elaborates the characteristics of medical data and common theoretical basis and method structure of data mining algorithms. A brief explanation of various data mining models is also introduced.
Secondly, this article mainly explores the classificatory function of the logistic regression analysis and random forest (decision tree) in data mining ,through a breast cancer related medical data sets . Moreover, the classification results acquireed better classification accuracy. It can be used as a diagnostic program to assist doctors to concentrate on observating patients with a higher probability of breast cancer.
Finally, this article makes an explaination for the classification and prediction results of two data sets, and puts forward relevant countermeasures and suggestions. At the end of the article, the author comes up with the deficiency and the direction of the future improvement.

Key words: Data mining; Regression analysis; Decision tree; Breast cancer

?著作權(quán)歸作者所有,轉(zhuǎn)載或內(nèi)容合作請聯(lián)系作者
【社區(qū)內(nèi)容提示】社區(qū)部分內(nèi)容疑似由AI輔助生成,瀏覽時請結(jié)合常識與多方信息審慎甄別。
平臺聲明:文章內(nèi)容(如有圖片或視頻亦包括在內(nèi))由作者上傳并發(fā)布,文章內(nèi)容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務(wù)。

相關(guān)閱讀更多精彩內(nèi)容

  • rljs by sennchi Timeline of History Part One The Cognitiv...
    sennchi閱讀 7,854評論 0 10
  • 你還愛我嗎? 愛是什么? 是忘人憂憐的悲切 還是魂斷藍橋的無聲
    孤獨的浪者閱讀 237評論 0 2
  • 世界,是自然界和人類社會的一切事物的總和。 我想,也包括上帝。 在最后的審判到來之前,眾多的死者只能靠睡覺或打牌打...
    8b0bf5e2fc28閱讀 6,997評論 0 3
  • 單詞15 每天半夜熱的腦袋癢,但是也不至于吹電扇空調(diào) 大早起來就想吃一碗辣辣的肉粉,最后走到公司了也沒有,只好在7...
    是魔王大人閱讀 128評論 3 0
  • 流水潺潺輕聲響, 幽做茶海靜觀賞, 遙似伊人迎面來, 淡淡脂粉撲鼻香。
    范春龍閱讀 178評論 0 1

友情鏈接更多精彩內(nèi)容