• Home
  • About
    • Sihan (Mint) Min photo

      Sihan (Mint) Min

      A Personal Website

    • Learn More
    • Email
    • LinkedIn
    • Instagram
    • Github
  • Academic Projects
  • DSLR
  • Instant Films
  • Dance
  • Alcohol

Machine Learning

09 May 2022

Node Embedding for Large-Scaled Knowledge Graph

  • Feb 2020 – Apr 2020
  • Discovered a better embedding method for nodes from a large-scale, real-world Knowledge Graph named YAGO, and completed various downstream tasks such as link prediction and node classification using PyTorch framework.
  • Initialized nodes with TransE embedding, sampled balanced subgraph based on edge-type and neighborhood, and fed subgraphs into mini batches to train a Relational Graph Convolutional Network (RGCN) with embedding output.

Skull Stripping Using Semi-Supervised Deep Learning

  • Apr 2019 - June 2019
  • Developed an automatic segmentation solution for brain MRI to keep the essential tissue with deep learning models.
  • Employed OSVOS model with Tensorflow using fully convolutional neural network (FCN): pre-trained parent network for basic foreground segmentation; fine-tuned the network on ground truth image pairs to minimize pixel-wise cross entropy loss.
  • Used less than 20 sets of segmented MRI, and got over 90% accuracy in pixel-wise comparison on testing sets.

Movie Recommendation by Rating Prediction

  • Jan 2019 - Mar 2019
  • Built personalized recommendation system using user movie ratings with multiple machine learning methods in Python.
  • After experiments on different model combinations, used ensemble of two best models to predict user rating: SVD model on user movie pairs’ ratings and user-based Linear Regression with movie tags information.
  • Achieved root MSE of 0.822 on over 4,000,000 user-movie pairs in final rating prediction.

Political Sentiments Analysis on Reddit Text

  • Apr 2018 - June 2018
  • Aggregated people’s attitudes towards the two Parties and Donald Trump by NLP on Reddit posts and comments.
  • Fit tokenized and lemmatized sentences from Reddit text into Machine Learning model (Logistic Regression) in Python, which learns to label sentiments of positive/negative towards two parties and Donald Trump.
  • Combined queries to MySQL database, and visualized clear political sentiments fluctuation over states in time series graph with R.


machine learning Share Tweet +1