Coupled Bayesian sets algorithm for semi-supervised learning and information extraction

Verma, S.; Hruschka, Jr. E.R.

Coupled Bayesian sets algorithm for semi-supervised learning and information extraction

Verma, S.; Hruschka, Jr. E.R.

URI: http://localhost:8080/xmlui/handle/123456789/1779

Date: 2012

Abstract:

Our inspiration comes from Nell (Never Ending Language Learning), a computer program running at Carnegie Mellon University to extract structured information from unstructured web pages. We consider the problem of semi-supervised learning approach to extract category instances (e.g. country(USA), city(New York)) from web pages, starting with a handful of labeled training examples of each category or relation, plus hundreds of millions of unlabeled web documents. Semi-supervised approaches using a small number of labeled examples together with many unlabeled examples are often unreliable as they frequently produce an internally consistent, but nevertheless, incorrect set of extractions. We believe that this problem can be overcome by simultaneously learning independent classifiers in a new approach named Coupled Bayesian Sets algorithm, based on Bayesian Sets, for many different categories and relations (in the presence of an ontology defining constraints that couple the training of these classifiers). Experimental results show that simultaneously learning a coupled collection of classifiers for random 11 categories resulted in much more accurate extractions than training classifiers through original Bayesian Sets algorithm, Naive Bayes, BaS-all and Coupled Pattern Learner (the category extractor used in NELL).

Show full item record

Files in this item

Name: Coupled-Bayesian- ...

Size: 311.6Kb

Format: PDF

View/Open

This item appears in the following Collection(s)

Department of Electronics Engineering [4]
This papers published by students/Research scholar/ faculty

Coupled Bayesian sets algorithm for semi-supervised learning and information extraction

Coupled Bayesian sets algorithm for semi-supervised learning and information extraction

Abstract:

Files in this item

This item appears in the following Collection(s)

Search in IDR

Browse

All of IDR

This Collection

My Account