Skip to content

ttss-upf/Multilingual-RDF-Verbalizer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

Multilingual RDF verbalizer - GSoC/2020

This repository contains the code of the Google Summer of Code project.

Author - Marco Sobrevilla

Abstract :

This project aims to create a natural language generation framework that verbalizes RDF triples.

An RDF triple set contains a triple set, each of the form < subject | predicate | object>, the model aims to take in a set of such triples and output the information in human-readable form.

For ex : < Marco_Sobrevilla | birthplace | Lima > < Marco Sobrevilla | lives in | Brazil > output: Marco Sobrevilla was born in Lima, and lives in Brazil. The model must be capable of doing the same in multiple languages, hence the name multilingual RDF verbalizer.

In particular, this work is divided in two parts:

  • The first one consists in exploring the use of pre-trained node embeddings into the previous GSoC project which uses Graph Attention Network to encode the triple set and a Transformer to decode its respective surface realisation. The s
  • The second one consists in exploring different ways to approach hierarchical decoding, i.e., execute each task (discourse ordering, text structuring and lexicalisation tasks) sequentially.

You can see my GSoC posts in this link.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • PLSQL 99.1%
  • Python 0.8%
  • Perl 0.1%
  • Shell 0.0%
  • Emacs Lisp 0.0%
  • Smalltalk 0.0%