Cooperative Institute for Research in Environmental Sciences at the University of Colorado Boulder

Cryospheric and Polar Processes Seminar

Applications of Machine Learning and Natural Language Processing in the Polar Deep Insights Project by Dr. Siri Jodha Khalsa, NSIDC research scientist.  

ABSTRACT:  Polar Deep Insights, an NSF-Funded EarthCube project that is building an end-to-end system to collect, analyze and make interactive the wealth of polar-related textual and scientific data mined from the Web. Researchers apply their domain knowledge to train machine-learning (ML) models that power an intuitive search engine to be utilized for polar research. The system includes an interface to analyze and visualize results using Banana and D3.js, giving researchers a better understanding of the relationship of the discovered content within the Polar data ecosystem.  This talk will provide a brief introduction to the tools and techniques of information retrieval and data science that we are being developed and applied: Apache Sparkler, Tika, and Solr; ML models (SVM, MLP ANN, Random Forest and Naive Bayes classifiers); Gensim and spaCy libraries for Natural Language Processing; and a concept editor that allows users to build an 'ontology-of-interest' to derive insights using the Insight Visualizer.

From a computer:  
Or iPhone one-tap :
    US: +16465588656,,5409618610#  
Or Telephone:
    Dial(for higher quality, dial a number based on your current location): 
        US: +1 646 558 8656  
    Meeting ID: 540 961 8610


Wednesday, October 3, 2018
11:00 am to 12:00 pm




  • CIRES employees
  • CU Boulder employees
  • General Public
  • NOAA employees
  • Science collaborators
  • Open to Public


Mistia Zuckerman


NSIDC, RL-2, Room 155/153