-
Notifications
You must be signed in to change notification settings - Fork 0
Comparing changes
Open a pull request
base repository: naourass/pawls
base: main
head repository: allenai/pawls
compare: main
- 11 commits
- 6 files changed
- 5 contributors
Commits on Sep 8, 2022
-
Configuration menu - View commit details
-
Copy full SHA for b950c13 - Browse repository at this point
Copy the full SHA b950c13View commit details
Commits on Oct 31, 2022
-
Adding requirements which are needed to run ci (allenai#188)
* Adding requirements which are needed to run ci * Tabs to spaces
Configuration menu - View commit details
-
Copy full SHA for 87731d5 - Browse repository at this point
Copy the full SHA 87731d5View commit details
Commits on Dec 6, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 1225660 - Browse repository at this point
Copy the full SHA 1225660View commit details
Commits on Feb 6, 2023
-
Bump json5 from 1.0.1 to 1.0.2 in /ui (allenai#196)
Bumps [json5](https://github.com/json5/json5) from 1.0.1 to 1.0.2. - [Release notes](https://github.com/json5/json5/releases) - [Changelog](https://github.com/json5/json5/blob/main/CHANGELOG.md) - [Commits](json5/json5@v1.0.1...v1.0.2) --- updated-dependencies: - dependency-name: json5 dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d422c53 - Browse repository at this point
Copy the full SHA d422c53View commit details -
Bump eventsource from 1.1.0 to 1.1.1 in /ui (allenai#178)
Bumps [eventsource](https://github.com/EventSource/eventsource) from 1.1.0 to 1.1.1. - [Release notes](https://github.com/EventSource/eventsource/releases) - [Changelog](https://github.com/EventSource/eventsource/blob/master/HISTORY.md) - [Commits](EventSource/eventsource@v1.1.0...v1.1.1) --- updated-dependencies: - dependency-name: eventsource dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c215c4e - Browse repository at this point
Copy the full SHA c215c4eView commit details -
Bump terser from 4.8.0 to 4.8.1 in /ui (allenai#183)
Bumps [terser](https://github.com/terser/terser) from 4.8.0 to 4.8.1. - [Release notes](https://github.com/terser/terser/releases) - [Changelog](https://github.com/terser/terser/blob/master/CHANGELOG.md) - [Commits](https://github.com/terser/terser/commits) --- updated-dependencies: - dependency-name: terser dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fc31b26 - Browse repository at this point
Copy the full SHA fc31b26View commit details -
Bump moment from 2.29.3 to 2.29.4 in /ui (allenai#189)
Bumps [moment](https://github.com/moment/moment) from 2.29.3 to 2.29.4. - [Release notes](https://github.com/moment/moment/releases) - [Changelog](https://github.com/moment/moment/blob/develop/CHANGELOG.md) - [Commits](moment/moment@2.29.3...2.29.4) --- updated-dependencies: - dependency-name: moment dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f170dbc - Browse repository at this point
Copy the full SHA f170dbcView commit details -
Bump decode-uri-component from 0.2.0 to 0.2.2 in /ui (allenai#192)
Bumps [decode-uri-component](https://github.com/SamVerschueren/decode-uri-component) from 0.2.0 to 0.2.2. - [Release notes](https://github.com/SamVerschueren/decode-uri-component/releases) - [Commits](SamVerschueren/decode-uri-component@v0.2.0...v0.2.2) --- updated-dependencies: - dependency-name: decode-uri-component dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d4d6eac - Browse repository at this point
Copy the full SHA d4d6eacView commit details -
Bump qs from 6.5.2 to 6.5.3 in /ui (allenai#194)
Bumps [qs](https://github.com/ljharb/qs) from 6.5.2 to 6.5.3. - [Release notes](https://github.com/ljharb/qs/releases) - [Changelog](https://github.com/ljharb/qs/blob/main/CHANGELOG.md) - [Commits](ljharb/qs@v6.5.2...v6.5.3) --- updated-dependencies: - dependency-name: qs dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fa64c78 - Browse repository at this point
Copy the full SHA fa64c78View commit details -
Bump express from 4.17.1 to 4.18.2 in /ui (allenai#195)
Bumps [express](https://github.com/expressjs/express) from 4.17.1 to 4.18.2. - [Release notes](https://github.com/expressjs/express/releases) - [Changelog](https://github.com/expressjs/express/blob/master/History.md) - [Commits](expressjs/express@4.17.1...4.18.2) --- updated-dependencies: - dependency-name: express dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 51b2a63 - Browse repository at this point
Copy the full SHA 51b2a63View commit details -
Line 43 of cli.pawls.preprocessors.tesseract in extract_page_tokens()…
… fails when the underlying text datatype is not actually text. I assume this is rare but is dependent on the original source PDF authoring tool. I have a pdf where once page only has a number on it and it appears the data type that is extracted to the dataframe is float64. This fails with the extract_page_tokens() function as written. Added .astype(str) to line 43 to force conversion to string, which should cover these kinds of corner cases. Working for me at least on the pdf that was crashingt the parser. (allenai#199)
Configuration menu - View commit details
-
Copy full SHA for 57fc217 - Browse repository at this point
Copy the full SHA 57fc217View commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff main...main