Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization

Advisor: Hsin-His ChenReporter: Chi-Hsin YuDate: 2009.09.24From NIPS 2008

Outlines

•Introduction•Related Work•Review SVM•SSLW (Semi-supervised Learning with Weakly-

Related Unlabeled Data)•Experiments•Conclusion

Introduction•Semi-supervised Learning (SSL)

▫takes advantage of a large amount of unlabeled data to enhance classification accuracy

•Cluster assumption▫puts the decision boundary in low density areas

without crossing the high density regions▫is only meaningful when the labeled and

unlabeled data are somehow closely related If they were weakly related, the labeled and

unlabeled data could be well separated

Introduction (conti.)

•This paper aiming to▫Identify a new data representation (in

feature space) By constructing a new kernel function

▫Advantages Informative to the target class(category) consistent with the feature coherence

patterns exhibiting in the weakly related unlabeled data

Related Work

•The two types of semi-supervised learning (SSL)▫Transductive SSL

labels only for the available unlabeled data▫Inductive SSL

also learns a classifier that can be used to predict labels for new data

SVM • Notations

▫ £ = {(x1, y1), . . . , (xl, yl)} Labeled documents

▫ U= {(xl+1, yl+1), . . . , (xn, yn)} unlabeled documents

▫ Document-word matrix D=(d1, d2, …, dn), di ∈ NV

V: the size of the vocabulary di: word-frequency vector for document i

▫ Word-Document matrix G=(g1, g2, …, gV) gi=(gi,1, gi,2,…,gi,n)

K=DTD, K ∈ Rnxn

Document pairwise similarity

α。 y=(α1y1, α2y2, …, αnyn) element-wise product

•K=DTD K=DTRD▫R ∈ RVxV : word-correlation matrix

•Two ways to construct the matrix RG=UW, W=(w1,w2,…wV)wi: internal representation o the i-th word R= WTW, T=UUT

the top p right eigenvectors of Gαi ≥0, ξ ≥0

SSLW (conti.)

•An Efficient Algorithm of SSLW

Experiments• Corpus

▫Reuters-21578 (9400 docs),▫WebKB (4518 docs)▫TREC AP88: an external information source for both

datasets (1000 documents, randomly selected)

Evaluation Methodology

•4 positive + 4 negative samples from each training set

•AUR (area under the ROC curve)•Averaging the AUR (ten times of each

experiment)

Conclusion

•SSLW ▫Significantly improves both the accuracy

and the reliability of text categorization, given a small training pool and the additional

unlabeled data that are weakly related to the test bed.

Thanks!!

Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization

Documents

UntrimmedNets for Weakly Supervised Action … · UntrimmedNets for Weakly Supervised Action Recognition and Detection ... Weakly supervised action recognition and detection: during

Hazard Categorization

DeepAtlas: Joint Semi-Supervised Learning of Image ...Fig.1: DeepAtlas for joint learning of weakly supervised registration and semi-supervised segmentation. Unlabeled moving/target

Inferring semantics from lyrics using weakly annotated data · Advances in automatic categorization of song lyrics are therefore pertinent to MIR and Information Retrieval (IR) overall

Unlabeled Far-field Deeply Subwavelength Topological

Adversarial Knowledge Transfer from Unlabeled Data

YOUCAT : WEAKLY SUPERVISED YOUTUBE VIDEO …people.mpi-inf.mpg.de/~smukherjee/slides/youcat-coling2012.pdf · YOUCAT : WEAKLY SUPERVISED YOUTUBE VIDEO CATEGORIZATION SYSTEM FROM META

Combining labeled and unlabeled data for text categorization with a large number of categories

ARMOURED: ADVERSARIALLY ROBUST MODELS USING UNLABELED …

Combining labeled and unlabeled data for text categorization with a large number of categories Rayid Ghani KDD Lab Project

Text categorization

Techniques For Exploiting Unlabeled Data

Concepts & Categorization

Learning from Positive and Unlabeled Examples

Toward Self-Supervised Object Detection in Unlabeled Videos · While weakly supervised methods accept well curated and "clean" labels (correct labels for all the images in the train-set),

Stochastic Unsupervised Learning on Unlabeled Data

Set 1 Unlabeled

Bootstrapping Information Extraction with Unlabeled Data

Representation Learning Theory with Unlabeled Data

Classification of unlabeled data: