Skip to content

Commit

Permalink
RSBench
Browse files Browse the repository at this point in the history
  • Loading branch information
arranger1044 committed Aug 20, 2024
1 parent fe41277 commit eed535e
Show file tree
Hide file tree
Showing 3 changed files with 30 additions and 0 deletions.
7 changes: 7 additions & 0 deletions _news/bortolotti2024rsbench-preprint.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
---
title: "RSBench"
collection: news
permalink: /news/rsbench-preprint
date: 2024-06-14
---
How to evaluate if neuro-symbolic systems are learning the right concepts or are falling prey of resoning shortcuts? Answer in <a href="https://arxiv.org/abs/2406.10368">this new preprint</a>!
23 changes: 23 additions & 0 deletions _publications/bortolotti2024rsbench.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
---
collection: publications
ref: "bortolotti2024rsbench"
permalink: "publications/bortolotti2024rsbench"
title: "A Benchmark Suite for Systematically Evaluating Reasoning Shortcuts"
date: 2024-06-14 00:00
tags: nesy shortcuts reasoning
image: "/images/papers/bortolotti2024rsbench/rsbench.png"
authors: "Samuele Bortolotti, Emanuele Marconato, Tommaso Carraro, Paolo Morettin, Emile van Krieken, Antonio Vergari, Stefano Teso, Andrea Passerini"
paperurl: "https://unitn-sml.github.io/rsbench/"
pdf: "https://arxiv.org/pdf/2406.10368"
venue: "arXiv 2024"
code: "https://github.com/unitn-sml/rsbench-code"
excerpt: "How to evaluate if neuro-symbolic systems are learning the right concepts or are falling prey of resoning shortcuts?"
abstract: "The advent of powerful neural classifiers has increased interest in problems that require both learning and reasoning. These problems are critical for understanding important properties of models, such as trustworthiness, generalization, interpretability, and compliance to safety and structural constraints. However, recent research observed that tasks requiring both learning and reasoning on background knowledge often suffer from reasoning shortcuts (RSs): predictors can solve the downstream reasoning task without associating the correct concepts to the high-dimensional data. To address this issue, we introduce rsbench, a comprehensive benchmark suite designed to systematically evaluate the impact of RSs on models by providing easy access to highly customizable tasks affected by RSs. Furthermore, rsbench implements common metrics for evaluating concept quality and introduces novel formal verification procedures for assessing the presence of RSs in learning tasks. Using rsbench, we highlight that obtaining high quality concepts in both purely neural and neuro-symbolic models is a far-from-solved problem."
supplemental:
bibtex: "@article{bortolotti2024benchmark,
title={A Benchmark Suite for Systematically Evaluating Reasoning Shortcuts},
author={Bortolotti, Samuele and Marconato, Emanuele and Carraro, Tommaso and Morettin, Paolo and van Krieken, Emile and Vergari, Antonio and Teso, Stefano and Passerini, Andrea},
journal={arXiv preprint arXiv:2406.10368},
year={2024}
}"
---
Binary file added images/papers/bortolotti2024rsbench/rsbench.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit eed535e

Please sign in to comment.