Seminars in Arthritis and Rheumatism
Volume 40, Issue 5 , Pages 413-420 , April 2011

Validation of Psoriatic Arthritis Diagnoses in Electronic Medical Records Using Natural Language Processing

  • Thorvardur Jon Love, MD

      Affiliations

    • Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts
    • Corresponding Author InformationAddress reprint requests to Thorvardur Jon Love, MD, Brigham and Women's Hospital, Harvard Medical School Boston, MA 02115
  • ,
  • Tianxi Cai, ScD

      Affiliations

    • Harvard School of Public Health, Boston, Massachusetts
  • ,
  • Elizabeth W. Karlson, MD

      Affiliations

    • Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts

References 

  1. Shbeeb M, Uramoto KM, Gibson LE, O'Fallon WM, Gabriel SE. The epidemiology of psoriatic arthritis in Olmsted County, Minnesota, USA, 1982-1991. J Rheumatol. 2000;27(5):1247–1250
  2. Madland TM, Apalset EM, Johannessen AE, Rossebo B, Brun JG. Prevalence, disease manifestations, and treatment of psoriatic arthritis in Western Norway. J Rheumatol. 2005;32(10):1918–1922
  3. Love TJ, Gudbjornsson B, Gudjonsson JE, Valdimarsson H. Psoriatic arthritis in Reykjavik, Iceland: prevalence, demographics, and disease course. J Rheumatol. 2007;34(10):2082–2088
  4. Singh JA, Holmgren AR, Noorbaloochi S. Accuracy of Veterans Administration databases for a diagnosis of rheumatoid arthritis. Arthritis Rheum. 2004;51(6):952–957
  5. Thomas SL, Edwards CJ, Smeeth L, Cooper C, Hall AJ. How accurate are diagnoses for rheumatoid arthritis and juvenile idiopathic arthritis in the general practice research database?. Arthritis Rheum. 2008;59(9):1314–1321
  6. Gabriel SE, Crowson CS, O'Fallon WM. A mathematical model that improves the validity of osteoarthritis diagnoses obtained from a computerized diagnostic database. J Clinical Epidemiol. 1996;49(9):1025–1029
  7. Arbuckle MR, McClain MT, Rubertone MV, Scofield RH, Dennis GJ, James JA, et al. Development of autoantibodies before the clinical onset of systemic lupus erythematosus. N Engl J Med. 2003;349(16):1526–1533
  8. Singh JA, Holmgren AR, Krug H, Noorbaloochi S. Accuracy of the diagnoses of spondylarthritides in veterans affairs medical center databases. Arthritis Rheum. 2007;57(4):648–655
  9. Terrin N, Schmid CH, Griffith JL, D'Agostino RB, Selker HP. External validity of predictive models: a comparison of logistic regression, classification trees, and neural networks. J Clin Epidemiol. 2003;56(8):721–729
  10. Breiman L. Random Forests. Machine Learning 2001. 2001;45(1):27
  11. Taylor W, Gladman D, Helliwell P, Marchesoni A, Mease P, Mielants H. Classification criteria for psoriatic arthritis: development of new criteria from a large international study. Arthritis Rheum. 2006;54(8):2665–2673
  12. McGonagle D, Tan AL. Are the classification criteria for psoriatic arthritis better than existing criteria for diagnosing psoriatic arthritis? Comment on the article by Taylor et al. Arthritis Rheum. 2007;56(2):699–700[author reply]
  13. Chandran V, Schentag CT, Gladman DD. Sensitivity of the classification of psoriatic arthritis criteria in early psoriatic arthritis. Arthritis Rheum. 2007;57(8):1560–1563
  14. D'Angelo S, Mennillo GA, Cutro MS, Leccese P, Nigro A, Padula A, et al. Sensitivity of the classification of psoriatic arthritis criteria in early psoriatic arthritis. J Rheumatol. 2009;36(2):368–370
  15. Katz JN, Barrett J, Liang MH, Bacon AM, Kaplan H, Kieval RI, et al. Sensitivity and positive predictive value of Medicare Part B physician claims for rheumatologic diagnoses and procedures. Arthritis Rheum. 1997;40(9):1594–1600
  16. Love TJ. Medical Record Browser. 0.4 ed. 2008.
  17. Embarcadero. Turbo Delphi 2006 Explorer. 2006 ed. Embarcadero 2006: RAD Environment.
  18. Bureau A, Dupuis J, Falls K, Lunetta KL, Hayward B, Keith TP, et al. Identifying SNPs predictive of phenotype using random forests. Gen Epidemiol. 2005;28(2):171–182
  19. Bureau A, Dupuis J, Hayward B, Falls K, Van Eerdewegh P. Mapping complex traits using Random Forests. BMC Gen. 2003;4(Suppl 1):S64
  20. Perdiguero-Alonso D, Montero FE, Kostadinova A, Raga JA, Barrett J. Random forests, a novel approach for discrimination of fish populations using parasites as biological tags. Int J Parasitol. 2008;38(12):1425–1434
  21. Sikic M, Tomic S, Vlahovicek K. Prediction of protein-protein interaction sites in sequences and 3D structures by random forests. PLoS Comput Biol. 2009;5(1):e1000278
  22. Sun YV, Cai Z, Desai K, Lawrance R, Leff R, Jawaid A, et al. Classification of rheumatoid arthritis status with candidate gene and genome-wide single-nucleotide polymorphisms using random forests. BMC Proc. 2007;1(Suppl 1):S62
  23. Ward MM, Pajevic S, Dreyfuss J, Malley JD. Short-term prediction of mortality in patients with systemic lupus erythematosus: classification of outcomes using random forests. Arthritis Rheum. 2006;55(1):74–80
  24. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–845
  25. Lindbäck J. rocTest. 0.3-0 ed. Uppsala 2009.
  26. Copeland KT, Checkoway H, McMichael AJ, Holbrook RH. Bias due to misclassification in the estimation of relative risk. Am J Epidemiol. 1977;105(5):488–495
  27. Kelly H, Bull A, Russo P, McBryde ES. Estimating sensitivity and specificity from positive predictive value, negative predictive value and prevalence: application to surveillance systems for hospital-acquired infections. J Hosp Infect. 2008;69(2):164–168
  28. Team RDC. R: A language and environment for statistical computing. Vienna, Austria; 2008.
  29. Liaw A, Wiener M. Classification and regression by randomForest. R News. 2002;2(3):18–22
  30. Harrell FE. Harrell miscellaneous. In: Dupont C, ed. 3.6-0 ed. 2009.

 The authors have no conflicts of interest to disclose.

PII: S0049-0172(10)00075-2

doi: 10.1016/j.semarthrit.2010.05.002

Seminars in Arthritis and Rheumatism
Volume 40, Issue 5 , Pages 413-420 , April 2011