Research memo

Research memo

  • Mistakes in the model construction using Random Forest
    • Whether there is Jaccard similarity should be the first step
      • No way to put weight on a feature, can only split the data manually
      • The threshold that all ANI has corresponding jaccard similarity is 0.9, but we can still make it 0.95, to be conservative
Written on September 27, 2017