Efficient inference and learning in a large knowledge base

Reasoning with extracted information using a locally groundable first-order probabilistic logic

Verfasser / Beitragende:
[William Wang, Kathryn Mazaitis, Ni Lao, William Cohen]
Ort, Verlag, Jahr:
2015
Enthalten in:
Machine Learning, 100/1(2015-07-01), 101-126
Format:
Artikel (online)
ID: 605478198
LEADER caa a22 4500
001 605478198
003 CHVBK
005 20210128100404.0
007 cr unu---uuuuu
008 210128e20150701xx s 000 0 eng
024 7 0 |a 10.1007/s10994-015-5488-x  |2 doi 
035 |a (NATIONALLICENCE)springer-10.1007/s10994-015-5488-x 
245 0 0 |a Efficient inference and learning in a large knowledge base  |h [Elektronische Daten]  |b Reasoning with extracted information using a locally groundable first-order probabilistic logic  |c [William Wang, Kathryn Mazaitis, Ni Lao, William Cohen] 
520 3 |a One important challenge for probabilistic logics is reasoning with very large knowledge bases (KBs) of imperfect information, such as those produced by modern web-scale information extraction systems. One scalability problem shared by many probabilistic logics is that answering queries involves "grounding” the query—i.e., mapping it to a propositional representation—and the size of a "grounding” grows with database size. To address this bottleneck, we present a first-order probabilistic language called ProPPR in which approximate "local groundings” can be constructed in time independent of database size. Technically, ProPPR is an extension to stochastic logic programs that is biased towards short derivations; it is also closely related to an earlier relational learning algorithm called the path ranking algorithm. We show that the problem of constructing proofs for this logic is related to computation of personalized PageRank on a linearized version of the proof space, and based on this connection, we develop a provably-correct approximate grounding scheme, based on the PageRank-Nibble algorithm. Building on this, we develop a fast and easily-parallelized weight-learning algorithm for ProPPR. In our experiments, we show that learning for ProPPR is orders of magnitude faster than learning for Markov logic networks; that allowing mutual recursion (joint learning) in KB inference leads to improvements in performance; and that ProPPR can learn weights for a mutually recursive program with hundreds of clauses defining scores of interrelated predicates over a KB containing one million entities. 
540 |a The Author(s), 2015 
690 7 |a Probabilistic logic  |2 nationallicence 
690 7 |a Personalized PageRank  |2 nationallicence 
690 7 |a Scalable learning  |2 nationallicence 
700 1 |a Wang  |D William  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
700 1 |a Mazaitis  |D Kathryn  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
700 1 |a Lao  |D Ni  |u Google Inc., 1600 Amphitheatre Parkway, 94043, Mountain View, CA, USA  |4 aut 
700 1 |a Cohen  |D William  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
773 0 |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 100/1(2015-07-01), 101-126  |x 0885-6125  |q 100:1<101  |1 2015  |2 100  |o 10994 
856 4 0 |u https://doi.org/10.1007/s10994-015-5488-x  |q text/html  |z Onlinezugriff via DOI 
898 |a BK010053  |b XK010053  |c XK010000 
900 7 |a Metadata rights reserved  |b Springer special CC-BY-NC licence  |2 nationallicence 
908 |D 1  |a research-article  |2 jats 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-springer 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.1007/s10994-015-5488-x  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Wang  |D William  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Mazaitis  |D Kathryn  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Lao  |D Ni  |u Google Inc., 1600 Amphitheatre Parkway, 94043, Mountain View, CA, USA  |4 aut 
950 |B NATIONALLICENCE  |P 700  |E 1-  |a Cohen  |D William  |u School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave., 15213, Pittsburgh, PA, USA  |4 aut 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Machine Learning  |d Springer US; http://www.springer-ny.com  |g 100/1(2015-07-01), 101-126  |x 0885-6125  |q 100:1<101  |1 2015  |2 100  |o 10994