Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: naourass/pawls
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: main
Choose a base ref
...
head repository: allenai/pawls
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 11 commits
  • 6 files changed
  • 5 contributors

Commits on Sep 8, 2022

  1. one bucket (allenai#187)

    geli-gel authored Sep 8, 2022
    Copy the full SHA
    b950c13 View commit details

Commits on Oct 31, 2022

  1. Adding requirements which are needed to run ci (allenai#188)

    * Adding requirements which are needed to run ci
    
    * Tabs to spaces
    egork520 authored Oct 31, 2022
    Copy the full SHA
    87731d5 View commit details

Commits on Dec 6, 2022

  1. Copy the full SHA
    1225660 View commit details

Commits on Feb 6, 2023

  1. Bump json5 from 1.0.1 to 1.0.2 in /ui (allenai#196)

    Bumps [json5](https://github.com/json5/json5) from 1.0.1 to 1.0.2.
    - [Release notes](https://github.com/json5/json5/releases)
    - [Changelog](https://github.com/json5/json5/blob/main/CHANGELOG.md)
    - [Commits](json5/json5@v1.0.1...v1.0.2)
    
    ---
    updated-dependencies:
    - dependency-name: json5
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    d422c53 View commit details
  2. Bump eventsource from 1.1.0 to 1.1.1 in /ui (allenai#178)

    Bumps [eventsource](https://github.com/EventSource/eventsource) from 1.1.0 to 1.1.1.
    - [Release notes](https://github.com/EventSource/eventsource/releases)
    - [Changelog](https://github.com/EventSource/eventsource/blob/master/HISTORY.md)
    - [Commits](EventSource/eventsource@v1.1.0...v1.1.1)
    
    ---
    updated-dependencies:
    - dependency-name: eventsource
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    c215c4e View commit details
  3. Bump terser from 4.8.0 to 4.8.1 in /ui (allenai#183)

    Bumps [terser](https://github.com/terser/terser) from 4.8.0 to 4.8.1.
    - [Release notes](https://github.com/terser/terser/releases)
    - [Changelog](https://github.com/terser/terser/blob/master/CHANGELOG.md)
    - [Commits](https://github.com/terser/terser/commits)
    
    ---
    updated-dependencies:
    - dependency-name: terser
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    fc31b26 View commit details
  4. Bump moment from 2.29.3 to 2.29.4 in /ui (allenai#189)

    Bumps [moment](https://github.com/moment/moment) from 2.29.3 to 2.29.4.
    - [Release notes](https://github.com/moment/moment/releases)
    - [Changelog](https://github.com/moment/moment/blob/develop/CHANGELOG.md)
    - [Commits](moment/moment@2.29.3...2.29.4)
    
    ---
    updated-dependencies:
    - dependency-name: moment
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    f170dbc View commit details
  5. Bump decode-uri-component from 0.2.0 to 0.2.2 in /ui (allenai#192)

    Bumps [decode-uri-component](https://github.com/SamVerschueren/decode-uri-component) from 0.2.0 to 0.2.2.
    - [Release notes](https://github.com/SamVerschueren/decode-uri-component/releases)
    - [Commits](SamVerschueren/decode-uri-component@v0.2.0...v0.2.2)
    
    ---
    updated-dependencies:
    - dependency-name: decode-uri-component
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    d4d6eac View commit details
  6. Bump qs from 6.5.2 to 6.5.3 in /ui (allenai#194)

    Bumps [qs](https://github.com/ljharb/qs) from 6.5.2 to 6.5.3.
    - [Release notes](https://github.com/ljharb/qs/releases)
    - [Changelog](https://github.com/ljharb/qs/blob/main/CHANGELOG.md)
    - [Commits](ljharb/qs@v6.5.2...v6.5.3)
    
    ---
    updated-dependencies:
    - dependency-name: qs
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    fa64c78 View commit details
  7. Bump express from 4.17.1 to 4.18.2 in /ui (allenai#195)

    Bumps [express](https://github.com/expressjs/express) from 4.17.1 to 4.18.2.
    - [Release notes](https://github.com/expressjs/express/releases)
    - [Changelog](https://github.com/expressjs/express/blob/master/History.md)
    - [Commits](expressjs/express@4.17.1...4.18.2)
    
    ---
    updated-dependencies:
    - dependency-name: express
      dependency-type: indirect
    ...
    
    Signed-off-by: dependabot[bot] <support@github.com>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    dependabot[bot] authored Feb 6, 2023
    Copy the full SHA
    51b2a63 View commit details
  8. Line 43 of cli.pawls.preprocessors.tesseract in extract_page_tokens()…

    … fails when the underlying text datatype is not actually text. I assume this is rare but is dependent on the original source PDF authoring tool. I have a pdf where once page only has a number on it and it appears the data type that is extracted to the dataframe is float64. This fails with the extract_page_tokens() function as written. Added .astype(str) to line 43 to force conversion to string, which should cover these kinds of corner cases. Working for me at least on the pdf that was crashingt the parser. (allenai#199)
    JSv4 authored Feb 6, 2023
    Copy the full SHA
    57fc217 View commit details
Loading