Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
nlp:text-mining [2015/05/21 22:30]
plf1
nlp:text-mining [2015/05/21 22:43] (current)
plf1
Line 4: Line 4:
 == Publications == == Publications ==
  
-^ http://​link.springer.com/​chapter/​10.1007/​978-3-642-40722-2_7| Probabilistic Explicit Topic Modeling Using Wikipedia ​  ​^^ ​+[http://​link.springer.com/​chapter/​10.1007/​978-3-642-40722-2_7| Probabilistic Explicit Topic Modeling Using Wikipedia  ^^ 
 | [[media:​nlp:​120px-explicit-topics-wikipedia.png]] | Joshua Hansen, Eric Ringger, Kevin Seppi   ​| ​ | [[media:​nlp:​120px-explicit-topics-wikipedia.png]] | Joshua Hansen, Eric Ringger, Kevin Seppi   ​| ​
 | :::                             | '''​ In Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2013) ''' ​                                       |  | :::                             | '''​ In Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology (GSCL 2013) ''' ​                                       | 
Line 12: Line 12:
  
  
-^ http://​darci.cs.byu.edu/​dheath/​pubs/​icsc_2013.pdf| Semantic Models as a Combination of Free Association Norms and Corpus-based Correlations ​  ​^^ ​+[http://​darci.cs.byu.edu/​dheath/​pubs/​icsc_2013.pdf| Semantic Models as a Combination of Free Association Norms and Corpus-based Correlations  ^^ 
 | [[media:​nlp:​120px-semantic-models.png]] | Derrall Heath, David Norton, Eric Ringger, Dan Ventura ​  ​| ​ | [[media:​nlp:​120px-semantic-models.png]] | Derrall Heath, David Norton, Eric Ringger, Dan Ventura ​  ​| ​
 | :::                             | '''​ In Proceedings of the Seventh IEEE International Conference on Semantic Computing (ICSC 2013) ''' ​                                       |  | :::                             | '''​ In Proceedings of the Seventh IEEE International Conference on Semantic Computing (ICSC 2013) ''' ​                                       | 
Line 19: Line 19:
  
  
-^ http://​proceedings.spiedigitallibrary.org/​proceeding.aspx?​articleid=1568659| Evaluating Supervised Topic Models in the Presence of OCR Errors ​  ​^^ ​+[http://​proceedings.spiedigitallibrary.org/​proceeding.aspx?​articleid=1568659| Evaluating Supervised Topic Models in the Presence of OCR Errors  ^^ 
 | [[media:​nlp:​120px-supervised-noisy-tm.png]] | Daniel Walker; Eric Ringger; Kevin Seppi   ​| ​ | [[media:​nlp:​120px-supervised-noisy-tm.png]] | Daniel Walker; Eric Ringger; Kevin Seppi   ​| ​
 | :::                             | '''​ The Conference on Document Recognition and Retrieval XX (DRR 2013) ''' ​                                       |  | :::                             | '''​ The Conference on Document Recognition and Retrieval XX (DRR 2013) ''' ​                                       | 
Line 27: Line 27:
  
  
-^ http://​www.abnms.org/​uai2012-apps-workshop/​papers/​WalkerEtal.pdf| Topics Over Nonparametric Time: A Supervised Topic Model Using Bayesian Nonparametric Density Estimation ​  ​^^ ​+[http://​www.abnms.org/​uai2012-apps-workshop/​papers/​WalkerEtal.pdf| Topics Over Nonparametric Time: A Supervised Topic Model Using Bayesian Nonparametric Density Estimation  ^^ 
 | [[media:​nlp:​120px-supervised-tonpt.png]] | Daniel Walker; Eric Ringger; Kevin Seppi   ​| ​ | [[media:​nlp:​120px-supervised-tonpt.png]] | Daniel Walker; Eric Ringger; Kevin Seppi   ​| ​
 | :::                             | '''​ Proceedings of the 9th Bayesian Modeling Applications Workshop (UAI 2012) ''' ​                                       |  | :::                             | '''​ Proceedings of the 9th Bayesian Modeling Applications Workshop (UAI 2012) ''' ​                                       | 
-| :::                             | We introduce a new supervised topic model that uses a nonparametric density estimator to model the distribution of real-valued +| :::                             | We introduce a new supervised topic model that uses a nonparametric density estimator to model the distribution of real-valued metadata given a topic. The model is similar to Topics Over Time, but replaces the beta distributions used in that model with a Dirichlet process mixture of normals. The use of a nonparametric density estimator allows for the fitting of a greater class of metadata densities. We compare our model with existing supervised topic models in terms of prediction and show that it is capable of discovering complex metadata distributions in both synthetic and real data.  | 
-metadata given a topic. The model is similar to Topics Over Time, but replaces the beta distributions used in that model with a +
-Dirichlet process mixture of normals. The use of a nonparametric density estimator allows for the fitting of a greater class of +
-metadata densities. We compare our model with existing supervised topic models in terms of prediction and show that it is capable of discovering complex metadata distributions in both synthetic and real data.  | +
  
  
  
-^ http://​flosshub.org/​sites/​flosshub.org/​files/​MacLean2011a.pdf| Knowledge Homogeneity and Specialization in the Apache HTTP Server Project ​  ​^^ ​+[http://​flosshub.org/​sites/​flosshub.org/​files/​MacLean2011a.pdf| Knowledge Homogeneity and Specialization in the Apache HTTP Server Project  ^^ 
 | [[media:​nlp:​120px-apache-knowledge.png]] | Alexander MacLean; Landon Pratt; Charles Knutson; Eric Ringger ​  ​| ​ | [[media:​nlp:​120px-apache-knowledge.png]] | Alexander MacLean; Landon Pratt; Charles Knutson; Eric Ringger ​  ​| ​
 | :::                             | '''​ Proceedings of the 7th International Conference on Open Source Systems (OSS 2011) ''' ​                                       |  | :::                             | '''​ Proceedings of the 7th International Conference on Open Source Systems (OSS 2011) ''' ​                                       | 
Line 45: Line 42:
  
  
-^ http://​sequoia.cs.byu.edu/​lab/​files/​pubs/​Pratt2011.pdf| Cliff Walls: An Analysis of Monolithic Commits Using Latent Dirichlet Allocation ​  ​^^ ​+[http://​sequoia.cs.byu.edu/​lab/​files/​pubs/​Pratt2011.pdf| Cliff Walls: An Analysis of Monolithic Commits Using Latent Dirichlet Allocation  ^^ 
 | [[media:​nlp:​120px-comment-analysis-lda.png]] | Landon Pratt; Alexander MacLean; Charles Knutson; Eric Ringger ​  ​| ​ | [[media:​nlp:​120px-comment-analysis-lda.png]] | Landon Pratt; Alexander MacLean; Charles Knutson; Eric Ringger ​  ​| ​
 | :::                             | '''​ Proceedings of the 7th International Conference on Open Source Systems (OSS 2011) ''' ​                                       |  | :::                             | '''​ Proceedings of the 7th International Conference on Open Source Systems (OSS 2011) ''' ​                                       | 
Line 52: Line 49:
  
  
-^ http://​cseweb.ucsd.edu/​~lvdmaaten/​workshops/​nips2010/​papers/​gardner.pdf| The Topic Browser: An Interactive Tool for Browsing Topic Models ​  ​^^ ​+[http://​cseweb.ucsd.edu/​~lvdmaaten/​workshops/​nips2010/​papers/​gardner.pdf| The Topic Browser: An Interactive Tool for Browsing Topic Models  ^^ 
 | [[media:​nlp:​120px-topic-browser.png]] | Matthew Gardner; Joshua Lutes; Jeff Lund; Josh Hansen; Dan Walker; Eric Ringger; Kevin Seppi   ​| ​ | [[media:​nlp:​120px-topic-browser.png]] | Matthew Gardner; Joshua Lutes; Jeff Lund; Josh Hansen; Dan Walker; Eric Ringger; Kevin Seppi   ​| ​
 | :::                             | '''​ Proceedings of the Workshop on Challenges of Data Visualization (NIPS 2010) ''' ​                                       |  | :::                             | '''​ Proceedings of the Workshop on Challenges of Data Visualization (NIPS 2010) ''' ​                                       | 
Line 61: Line 58:
  
  
-^ http://​nlp.cs.byu.edu/​~dan/​papers/​emnlp_2010.pdf| Evaluating Models of Latent Document Semantics in the Presence of OCR Errors ​  ​^^ ​+[http://​nlp.cs.byu.edu/​~dan/​papers/​emnlp_2010.pdf| Evaluating Models of Latent Document Semantics in the Presence of OCR Errors  ^^ 
 | [[media:​nlp:​120px-noisyocr-lds.png]] | Dan Walker; Bill Lund; Eric Ringger ​  ​| ​ | [[media:​nlp:​120px-noisyocr-lds.png]] | Dan Walker; Bill Lund; Eric Ringger ​  ​| ​
 | :::                             | '''​ EMNLP 2010 ''' ​                                       |  | :::                             | '''​ EMNLP 2010 ''' ​                                       | 
Line 67: Line 64:
  
  
-^ http://​contentdm.lib.byu.edu/​cdm/​singleitem/​collection/​ETD/​id/​1964/​rec/​1| Bisecting Document Clustering Using Model-Based Methods ​  ​^^ +[http://​contentdm.lib.byu.edu/​cdm/​singleitem/​collection/​ETD/​id/​1964/​rec/​1| Bisecting Document Clustering Using Model-Based Methods]  ​^^ 
 | [[media:​nlp:​140px-bisecting-clustering.png]] | Aaron Davis   ​| ​ | [[media:​nlp:​140px-bisecting-clustering.png]] | Aaron Davis   ​| ​
 | :::                             | '''​ Master'​s Thesis. ​ Advised by Eric Ringger. ''' ​                                       |  | :::                             | '''​ Master'​s Thesis. ​ Advised by Eric Ringger. ''' ​                                       | 
Line 74: Line 71:
  
  
-^ http://​faculty.cs.byu.edu/​~ringger/​CS601R/​papers/​WalkerRingger-Gibbs-kdd2008.pdf| Model-Based Document Clustering with a Collapsed Gibbs Sampler ​  ​^^ ​+[http://​faculty.cs.byu.edu/​~ringger/​CS601R/​papers/​WalkerRingger-Gibbs-kdd2008.pdf| Model-Based Document Clustering with a Collapsed Gibbs Sampler  ^^ 
 | [[media:​nlp:​120px-document-clustering.png]] | Daniel Walker; Eric Ringger ​  ​| ​ | [[media:​nlp:​120px-document-clustering.png]] | Daniel Walker; Eric Ringger ​  ​| ​
 | :::                             | '''​ In Proceedings of the Conference on Knowledge Discovery and Data Mining (KDD 2008) ''' ​                                       |  | :::                             | '''​ In Proceedings of the Conference on Knowledge Discovery and Data Mining (KDD 2008) ''' ​                                       | 
Line 81: Line 78:
  
  
-^ http://​synapse.cs.byu.edu/​papers/​drv.icsc2008.pdf| Sentiment Regression: Using Real-Valued Scores to Summarize Overall Document Sentiment ​  ​^^ ​+[http://​synapse.cs.byu.edu/​papers/​drv.icsc2008.pdf| Sentiment Regression: Using Real-Valued Scores to Summarize Overall Document Sentiment  ^^ 
 | [[media:​nlp:​150px-sentiment-regression.png]] | Adam Drake; Eric Ringger; Dan Ventura ​  ​| ​ | [[media:​nlp:​150px-sentiment-regression.png]] | Adam Drake; Eric Ringger; Dan Ventura ​  ​| ​
 | :::                             | '''​ In Proceedings of the Second IEEE International Conference on Semantic Computing (ICSC 2008) ''' ​                                       |  | :::                             | '''​ In Proceedings of the Second IEEE International Conference on Semantic Computing (ICSC 2008) ''' ​                                       | 
nlp/text-mining.1432247432.txt.gz · Last modified: 2015/05/21 22:30 by plf1
Back to top
CC Attribution-Share Alike 4.0 International
chimeric.de = chi`s home Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0