m for various of semantic relationships Overview Models...

Improving Context-Aware Semantic Relationships in Sparse Mobile DatasetsPeter Hansel, Nik Marda, William Yin

{pwhansel, nmarda, wyin}@stanford.edu

Overview➢ Cutting-edge NLP techniques often

fail to capture semantic context➢ Microblogging (and many other types

of mobile datasets) have inputs other than text

➢ How do we make relationships between sentences more semantically salient using multimodal data?

Data and Features➢ Politician Tweets Dataset [1]

○ Tweets associated with user locations

○ Coordinates collected using GeoPy Nominatim API

○ Date/time encoded as cyclical continuous feature

○ Data stripped of URLs and NLTK toolkit stopwords

➢ Tweet similarity data labeled by political science students and averaged

Existing Methods➢ Doc2Vec generates a sentence

embedding space allowing for comparison [2]

➢ CoSal uses contextually significant words in weighted BoW embeddings [3]

➢ Neither incorporates non-textual data

Models

Iterative Minimization

Discussion

Future Directions

References

➢ Test on tweets from local politicians and see if they differ from national politicians (controlling for location)

➢ Distort the word embedding to directly incorporate information from multimodal features

➢ Beyond Twitter and microblogging: other extended data

B. Kay. “Politician Tweets.” data.world. 2018.Q. Le and T. Mikolov. "Distributed representations of sentences and documents." ICML. 2014.E. Zelikman. "Context is Everything: Finding Meaning Statistically in Semantic Spaces." arXiv (2018).L. Maaten and G. Hinton. "Visualizing data using t-SNE." JMLR. 2008.

➢ Iterative Minimization - Given embeddings a, b, and multimodal features ma,i , mb,i , iteratively optimized various distance functions di for various multimodal features:

➢ PCA for dimensionality reduction of sentence embedding space➢ t-Distributed Stochastic Neighbor Embedding (t-SNE) for constructing

visualizations and determining relative similarity [4]

➢ Manually-annotated comparisons

➢ Distance function

➢ Iteratively optimizing objective○ Discrete ranking system

means no continuous gradient○ Minimizing this function:

➢ Scaling outputs of distance functions / integrating into 𝑓 above

PCA and t-SNE

“RT @CantorPress: House Republicans Unveil Debt Plan via @NROCorner #tcot #GOP #2Futures” - Peoria, AZ, 2011-07-25 18:58:42“I'll be going on @foxnews at 11:20 (ET) to discuss the current negotiations of the #debtceiling. Check it out!” - Arizona, 2011-07-13 14:29:17

➢ Multimodal data improves recognition of semantic relationships

➢ Especially valuable when tweets are about the same event but lack textual similarity

➢ Iterative Minimization has an upper bound on performance

Including Multimodal FeaturesExcluding Multimodal Features

Green: “Good news- The House passed a bill to exempt those who lost coverage due to the failure of #Obamacare's co-ops from the individual mandate.” - Janesville, WI, 2016-10-03 20:39:43Blue: “RT @DrPhilRoe: Bottom line: Obamacare is NOT working, especially not in Tennessee. Tennesseans deserve a #BetterWay.” - Jefferson, LA, 2016-10-06 19:34:56Red: “It is too soon to rule out impacts to Florida. Please visit so that you and your family can get prepared.” - The Sunshine State, 2016-10-01 21:16:00

Labeled data Rankings

Iterative Minimization Rankings

m for various of semantic relationships Overview Models...

Documents

Fine-Grained Sentiment Analysis of Restaurant Customer …cs229.stanford.edu/proj2018/report/195.pdf · 2019-01-06 · Recent years have seen a substantial progress in NLP tasks with

How Real is Real? Quantitative and Qualitative comparison ...cs229.stanford.edu/proj2018/report/102.pdfThe fully connected NN takes in the input image with di-mensions 1 x 28 x 28,

SNE SIMULATION NOTES EUROPE...SNE E DITORIAL - C ONTENT - I NFORMATION SNE Editorial Board SNE - Simulation Notes Europe is advised and supervised by an international scientific editorial

A “generative” model for computing electromagnetic ﬁeld …cs229.stanford.edu/proj2018/report/233.pdfA “generative” model for computing electromagnetic ﬁeld solutions Ben

CS 229 Project Final Report A Method for Modifying Facial ...cs229.stanford.edu/proj2018/report/42.pdf · when faces are intelligently selected from the images. Cropping out faces

· -!JdBd pun as-Inme.nxa sne Bpsnl uoasoq ueur os uop so pun ' sne Jne -JOSSOq goop upspld sne pun opuoqoplsne uonpuos -08 nz" pun oywos sne pof sne pun soleqzanusne JnB

CS229 Final Project Report - Machine learningcs229.stanford.edu/proj2011/CS229 Final Project Report.pdf1 CS229 Final Project Report A Multi-Task Feature Learning Approach to Human

SNE vacancies (May 2019) Ref. Deadline Cost-free SNE ... · SNE vacancies (May 2019) Ref. Deadline Cost-free SNE Comment CLIMA-C-3 25.07.2019 COMP-D-6 25.06.2019 shortened deadline

Improving Product Categorization from Label …cs229.stanford.edu/proj2018/report/258.pdfNode2Vec allows for random walks to be selected ”between“ Depth-First Search and Breadth-First

doodles Input (28x28x1) Models produced scores that were ...cs229.stanford.edu/proj2018/poster/98.pdf · Quick, Draw! Doodle Recognition Kristine Guo (kguo98@stanford.edu), James

Convolutional Neural Network Approach CS 229 Machine ...cs229.stanford.edu/proj2018/poster/58.pdf · the imagehas been resized in 256 x 256 x 3 using the cv2 packagein python. Table

CS229 Supervised Learning - Computer science

CS229 - Probability Theory Review

CS229 MACHINE LEARNING, STANFORD UNIVERSITY, …cs229.stanford.edu/proj2016/report/FegelisHebert...CS229 MACHINE LEARNING, STANFORD UNIVERSITY, DECEMBER 2016 3 t= f(k x t +h y t) (6)

HITPREDICT: PREDICTING HIT SONGS USING SPOTIFY DATA …cs229.stanford.edu/proj2018/report/16.pdf · 2019-01-06 · (hits) and negative (non-hits) examples, we removed two thirds of

Discussion & Future Work Models for Willingness Problem …cs229.stanford.edu/proj2018/poster/120.pdf · 2019. 1. 6. · 1. Logistic Regression (binary configuration with L 2 regu-

Quesreppol sne

Abstract - cs229.stanford.edu

SNE Thesis

MELJUN CORTES Cs229 electronics for_cs_switching_theory_updated_hours