Large
cohort study gained its popularity in
biomedical research and demonstrated its application in exploring
disease etiology and pathogenesis,improving the
prognosis of
disease,as well as reducing the
burden of diseases.
Data science is an interdisciplinary field that uses scientific
methods from
computer science and
statistics to extract insights or
knowledge from data in a specific domain.The results from the combination of the two would provide new evidence for developing the
strategies and
measures on
disease prevention and control.This
review included a brief introduction of
data science,descriptions on characteristics of large cohort data according to the development of the study design,and application of
data science at each stage of a large
cohort study,as well as prospected the application of
data science in the
future large
cohort studies.