About me

Xianlong

Hello! My name is Xianlong  Wang Email: xianlong.wang@gmail.com



Professional summary:

Seasoned data scientist with 6+ years of hand-on experience in machine learning including both supervised and unsupervised learning, data mining, big data processing and modeling. Solid experience in predictive models, linear models, time series, nonparametric methods, SVM, random forest, boosting, and bagging. Well-versed in computer programming with Python, R, SQL, C/C++, PHP, JavaScript, and certified SAS 9 base and advanced programmer. Successfully developed, both independently and collaboratively, Windows, Unix/Linux, and web based software applications.

Education:

Doctor of Philosophy, Statistics, Oregon State University, 2009
Dissertation:Classification for Longitudinal Data
Master of Science, Applied Mathematics & Computer Science, Oregon State University, 2008
Thesis:Chen-Stein Method and Its Applications in DNA Sequence Analysis
Master of Science, Statistics, Oregon State University, 2005
Thesis:Joint Modeling Clustered Binary & Continuous Responses with Probit Models
Bachelor of Science, Material Science & Engineering, Northeast Forestry University, China, with honors, 1999
Thesis: Statistical Comparison of Properties of Birch from Different Varieties

Working experience:

07/15 – present Data Scientist — Marketing Analytics, Nordstrom Inc.
04/09 – 07/15 Staff Scientist — Biostat & Biomath program, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center
09/05 – 04/09 Statistical Consultant — Statistics Department, Oregon State University
12/06 – 04/09 Research Assistant — Statistics Department, Oregon State University
06/05 – 09/06 Computer Administrator — Statistics Department, Oregon State University
09/04 – 04/09 Teaching Assistant — Statistics Department, Oregon State University
03/01 – 09/05 Research Assistant — Department of Wood Science & Engineering, Oregon State University

Selected publications:

L K Teixeira, X Wang, Y Li, S E Reed, X Wu, P Wang, S I. Reed. 2015. “Cyclin E Deregulation Promotes Loss of Specific Genomic Regions.” Current Biology
B Zhang, et al. 2014. “Proteogenomic characterization of human colon and rectal cancer.” Nature link
J Hu, X Wang, P Wang. 2014. “Testing Gene-Gene Interactions in Genome Wide Association Studies.” Genetic Epidemiology link
J Kennedy, et al. 2013. “Demonstrating the feasibility of large-scale development of standardized assays to quantify human proteins.” Nature Methods link
X Wang, L Qin, H Zhang, Y Zhang, L Hsu, P Wang. 2013. “A regularized multivariate regression approach for eQTL analysis.” Statistics in Biosciences link
X Wang, A Qu. 2013. “Efficient classification for longitudinal data.” Computational Statistics and Data Analysis link

Skill set:

  • R, C/C++, Java, Perl, Python, SQL, JavaScript, HTML, PHP, Android
  • Windows, MS office, Linux
  • SAS Certified Base Programmer for SAS9
  • SAS Certified Advanced Programmer for SAS9

R packages:

groupRemMap
Implements the regularized multivariate regression for identifying master predictors using the GroupRemMap penalty.
remMap
Implements the regularized multivariate regression for identifying master predictors using the remmap penalty.