reproducible orca running image tests #3237

antoinerg · 2018-11-12T21:35:33Z

This is a WIP proof of principle showcasing that Orca can yield reproducible output when running a software implementation of OpenGL. In this branch, test images generated on the CI are compared with baselines generated on my laptop with a tolerance of zero.

Because this is software OpenGL, a lot more CPU time is needed. On one core, running the test took roughly 20-30 minutes. By making use of 8 parallel containers, this time is cut down to ~5 minutes which is faster than the Jasmine tests we currently have.

To do:

Move the Dockerfile for "antoinerg/orca-reproducible:latest" into this repo and build the image on CircleCI
Reuse the nice code from @etpinard's PR Image tests using orca #2615
Replace bash scripts with node.js code
Update the dev commands npm run (test-image|baseline) and npm run docker and make sure they run in parallel
Orca should use the plotly.js in the build folder instead of using latest.
Remove images from test_images_diff that are exactly the same.
Add mapbox support
Add mathjax support

Accuracy issue:

Improve font rendering which is far too different from what it used to be (ex.: https://github.com/plotly/plotly.js/pull/3237/files?short_path=e396be8#diff-e396be8508e78ecdf1f6fdf653810183)
Investigate why the marker's size are rendered bigger than they should in gl3d (ex.: gl2d_fill_trace_tozero_order)

alexcjohnson · 2018-11-13T16:51:55Z

Very nice work @antoinerg! Does the parallelism work locally, based on the number of cores on the local machine? 🙏

I see a step pushing artifacts, but I can't actually find them in the test results - are they there and I'm just not finding them?

There's a lot to look at in the image outputs, and we want to be extremely careful since it's updating every mock we have!

In the SVG mocks, the only substantial change I see is in the fonts: some have very different sizing and some don't seem to be recognized in orca at all. Here's onion skin halfway between the two:

I know fonts have always been a massive pain to get to match, but this seems farther off than is acceptable.

In WebGL the biggest thing I see is that all markers got substantially bigger (in both 2D and 3D). Onion skin again:

If I look at one of the mocks that puts GL2D and SVG side-by-side (for example gl2d_fill_trace_tozero_order), the old version seems more correct. Not really sure how to troubleshoot that though... some feature we're using for only marker sizing that behaves oddly in software OpenGL?

There's one issue in GL3D that seems to have been fixed here though! Some of our baselines don't correctly rotate tick labels. See for example gl3d_cone_rossler - on this branch and on my screen they're rotated, but in the master baseline they are not. Oddly enough some of the master baselines do have rotated labels, eg gl3d_chrisp-nan-1 which makes it even more puzzling, but if we can get the rest of these issues solved at least that one we can ignore 🎉

antoinerg · 2018-11-13T17:36:40Z

Very nice work @antoinerg!

Thank you @alexcjohnson !

Does the parallelism work locally, based on the number of cores on the local machine?

Although the script to do so is not here yet, yes it would work by running one container per core in parallel! FIY, running a bunch of Orcas in parallel in the same Docker container was not reliable in my experience. I will add a script to do so if we end up adopting Orca. Note that some gl3d mocks require quite a bit memory to render (maybe 1GB) so we may not want to run 8 at a time on a laptop.

I see a step pushing artifacts, but I can't actually find them in the test results - are they there and I'm just not finding them?

They are in there: https://circleci.com/gh/plotly/plotly.js/18671#artifacts/containers/0

In the SVG mocks, the only substantial change I see is in the fonts: some have very different sizing and some don't seem to be recognized in orca at all.

When I build the Docker image, I need to install all the fonts we would like to support. As of right now, I just copied the ones from Orca's Dockerfile: https://github.com/plotly/orca/blob/a0e7314a784802b4824b359aeff891e0d40be184/deployment/Dockerfile#L51-L77
Hopefully, I am just missing a few which explains the difference. Next step would be to check the OS and Electron's font rendering options.

If I look at one of the mocks that puts GL2D and SVG side-by-side (for example gl2d_fill_trace_tozero_order), the old version seems more correct. Not really sure how to troubleshoot that though... some feature we're using for only marker sizing that behaves oddly in software OpenGL?

Nice find 🔍 ! It's true that the markers are rendered bigger than they should as clearly shown in gl2d_fill_trace_tozero_order. I think you're right: the browser makes an OpenGL call that is rendered oddly by this software implementation of OpenGL. Either we're using an old badly supported GL call (like we do here gl-vis/gl-error3d#5) or the software renderer needs to be updated/improved.

Thanks for the review! I guess the interim conclusion is that the current solution is exactly reproducible but not yet accurate 🤔

antoinerg · 2018-11-14T21:51:38Z

About fonts, the Droid fonts do not get downloaded when building Orca's Dockerfile so I opened an issue over there plotly/orca#146. When building orca-reproducible I will instead copy the fonts we had in the old image-server.

antoinerg · 2018-11-14T23:15:54Z

I am hopeful we can fix the issue with fonts or at least get close enough. I think I'm only missing "Courier New" as you can see in this onion skin halfway between the two:

The updated baselines are in commit 9779c46!

antoinerg · 2018-11-15T17:05:54Z

@alexcjohnson

I found out the reason why the markers are rendered bigger in WebGL!

The size of markers in regl-scatter2d is set by this line:

gl_PointSize = 2. * size * pixelRatio;

The pixelRatio above is forced to be 2.5 in Orca on this line instead of its previous value of 2:

  plotGlPixelRatio: 2.5,

By forcing it to be 2, the markers are now correctly sized:

I will push a new set of baselines with this change!

alexcjohnson · 2018-11-15T18:22:39Z

gl_PointSize = 2. * size * pixelRatio;

Oh wow, that's a problem, nice find! But the fix is not to force orca to use the default value. pixelRatio should not affect anything about the plot except the quality of antialiasing. We should be able to make a regular in-browser plot using plotGlPixelRatio: 5 or something and, other than potentially running out of memory, everything should look the same just smoother.

So was pixelRatio already included in size or something? Should we just change that line to gl_PointSize = 4. * size; or something like that?

antoinerg · 2018-11-15T18:30:24Z

So was pixelRatio already included in size or something?

No I don't think it's already included in variable size

Should we just change that line to gl_PointSize = 4. * size; or something like that?

I am under the impression that we've always been rendering plots with pixelRatio=2 so yes I would be tempted to change that line to gl_PointSize = 4. * size; as well as the few other lines that have pixelRatio in them.

We should be able to make a regular in-browser plot using plotGlPixelRatio: 5 or something and, other than potentially running out of memory, everything should look the same just smoother.

Ok thanks for the explanation! I will test this out and open an issue if it doesn't behave as such.

But the fix is not to force orca to use the default value.

I did even worse in my last commit: I forced the value to be 2 in plotly.js itself. This was just for testing however. We should fix the aforementioned issue! At the very least this PR will have allow us to uncover a bug 🪲 🔍!

Thank you @alexcjohnson

archmoj · 2019-04-10T12:09:58Z

gl_PointSize = 2. * size * pixelRatio;

Oh wow, that's a problem, nice find! But the fix is not to force orca to use the default value. pixelRatio should not affect anything about the plot except the quality of antialiasing. We should be able to make a regular in-browser plot using plotGlPixelRatio: 5 or something and, other than potentially running out of memory, everything should look the same just smoother.

So was pixelRatio already included in size or something? Should we just change that line to gl_PointSize = 4. * size; or something like that?

@etpinard That's exactly what I still think we should do here: gl-vis/regl-scatter2d#20.

archmoj · 2020-08-21T14:41:51Z

@antoinerg regarding gl2d marker sizes are fixed in #5093. FYI - I simply checked the item in the PR description.

archmoj · 2021-06-25T14:05:40Z

Closing now that #5724 merged.

antoinerg added 26 commits November 8, 2018 18:10

test software rendering on CI

dfcb720

add test-image-orca to workflow

f79008a

do no checkout source for test-image-orca

b0e4789

do checkout source for test-image-orca

75a9fcd

baselines from local dev machine

822a7df

compare test images with baselines

3cb8631

shuffle list of mocks to even out time of calculation per chunk

362769d

remove previous test-image jobs

eb4cde3

shuffle in a deterministic fashion

6bc3549

print out level of parallelism

02abebb

fix missing whitespace

c360099

replace env variable CI with CIRCLECI

ca939fd

split work across nodes

a0f3d38

add parallelism

3aa1244

bump parallelism to 8

d6543bf

reduce no_output_timeout to detect when orca freezes

a21d0e7

support 3 digit parallelism number

d23f8ed

bump parallelism to 12

e178fd1

update a few baselines with reproducible ones

cb52e30

set parallelism back to 8

4a44a98

retry if orca hangs

b248309

fix executable path

b9ea11a

fix retry logic

18ba1a3

orca-build-verify.sh accepts mock's name as command line argument

dff6773

double-check images that fail

d05fed0

comment runner code

5a88f9d

antoinerg added type: maintenance labels Nov 12, 2018

fix image paths

d86fa93

antoinerg added 5 commits November 14, 2018 13:41

use plotly.js from build folder

bb0a91c

update baseline using current branch's plotly.js

8dfe096

update another using currently built plotly.js

835e588

use orca with MathJax support

ba050a0

delete diff images that perfectly match

799d64a

antoinerg force-pushed the orca-reproducible branch from 80730ed to 799d64a Compare November 14, 2018 20:37

update baselines containing MathJax now that it's enabled

b3b56c3

antoinerg force-pushed the orca-reproducible branch from cd76e85 to b3b56c3 Compare November 14, 2018 21:47

update baselines after installing missing fonts

9779c46

antoinerg force-pushed the orca-reproducible branch from d15b171 to 9779c46 Compare November 15, 2018 17:41

update baselines with pixelRatio forced to 2

70ea751

antoinerg mentioned this pull request Nov 15, 2018

config option plotGlPixelRatio changes size of markers #3246

Closed

antoinerg removed the status: discussion needed label Nov 20, 2018

antoinerg mentioned this pull request Mar 15, 2019

Preserve marker size in scattergl traces with different glPixelRatio values #3637

Closed

etpinard mentioned this pull request Mar 29, 2019

Image tests using orca #2615

Closed

archmoj added status: on hold and removed status: in progress labels Mar 17, 2020

archmoj mentioned this pull request Jun 10, 2021

Simplify the process of creating baselines using Kaleido and improve image & other export test systems #5724

Merged

2 tasks

archmoj closed this Jun 25, 2021

archmoj deleted the orca-reproducible branch June 25, 2021 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reproducible orca running image tests #3237

reproducible orca running image tests #3237

antoinerg commented Nov 12, 2018 •

edited

Loading

alexcjohnson commented Nov 13, 2018

antoinerg commented Nov 13, 2018 •

edited

Loading

antoinerg commented Nov 14, 2018

antoinerg commented Nov 14, 2018 •

edited

Loading

antoinerg commented Nov 15, 2018

alexcjohnson commented Nov 15, 2018

antoinerg commented Nov 15, 2018 •

edited

Loading

archmoj commented Apr 10, 2019

archmoj commented Aug 21, 2020

archmoj commented Jun 25, 2021

reproducible orca running image tests #3237

reproducible orca running image tests #3237

Conversation

antoinerg commented Nov 12, 2018 • edited Loading

alexcjohnson commented Nov 13, 2018

antoinerg commented Nov 13, 2018 • edited Loading

antoinerg commented Nov 14, 2018

antoinerg commented Nov 14, 2018 • edited Loading

antoinerg commented Nov 15, 2018

alexcjohnson commented Nov 15, 2018

antoinerg commented Nov 15, 2018 • edited Loading

archmoj commented Apr 10, 2019

archmoj commented Aug 21, 2020

archmoj commented Jun 25, 2021

antoinerg commented Nov 12, 2018 •

edited

Loading

antoinerg commented Nov 13, 2018 •

edited

Loading

antoinerg commented Nov 14, 2018 •

edited

Loading

antoinerg commented Nov 15, 2018 •

edited

Loading