Overhaul of ResNet API #174

theabhirath · 2022-06-21T10:28:10Z

This PR completely re-writes the current ResNet API to make it more powerful, more extensible and to reduce code duplication.

Why this PR?

While making ResNet more fully-featured, this PR will also:

Add support for DropBlock to ResNet, a type of regularisation used in place of Dropout in some networks
Add support for an optional deeper stem front-up
Add support for attention layers
Add support for multiple pooling options in the classifier head
Allow for more ResNet block variants (such as those from the Bag of Tricks paper)

Things to do

Other PRs to land before this one

activation functions in Chains directly without broadcasting (Define activation functions taking arrays as input NNlib.jl#423).
- Needs a patch release.
rand_like and randn_like in MLUtils (rand_like and randn_like JuliaML/MLUtils.jl#101)
- Needs a patch release

Miscellaneous fixes

Adds a type argument to densenet for nblocks to avoid hitting integer edge cases

theabhirath · 2022-06-23T06:25:49Z

Some perks of the new API:

0.7.2:

julia> model = ResNet(50);

julia> @benchmark Zygote.gradient(p -> sum($model(p)), $x)
BenchmarkTools.Trial: 1 sample with 1 evaluation.
 Single result which took 6.698 s (87.06% GC) to evaluate,
 with a memory estimate of 2.46 GiB, over 47810 allocations.

julia> model = ResNet(18);

julia> @benchmark Zygote.gradient(p -> sum($model(p)), $x)
BenchmarkTools.Trial: 2 samples with 1 evaluation.
 Range (min … max):  2.576 s …  2.580 s  ┊ GC (min … max): 87.60% … 87.65%
 Time  (median):     2.578 s             ┊ GC (median):    87.63%
 Time  (mean ± σ):   2.578 s ± 2.770 ms  ┊ GC (mean ± σ):  87.63% ±  0.03%

  █                                                      █
  █▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁█ ▁
  2.58 s        Histogram: frequency by time        2.58 s <

 Memory estimate: 1.01 GiB, allocs estimate: 19594.

This PR:

julia> model = ResNet(50);

julia> @benchmark Zygote.gradient(p -> sum($model(p)), $x)
BenchmarkTools.Trial: 1 sample with 1 evaluation.
 Single result which took 5.644 s (85.62% GC) to evaluate,
 with a memory estimate of 2.50 GiB, over 45095 allocations.

julia> model = ResNet(18);

julia> @benchmark Zygote.gradient(p -> sum($model(p)), $x)
BenchmarkTools.Trial: 13 samples with 1 evaluation. 
 Range (min … max):  338.901 ms … 612.421 ms  ┊ GC (min … max):  4.01% … 46.50%
 Time  (median):     345.959 ms               ┊ GC (median):     5.21%
 Time  (mean ± σ):   416.913 ms ±  90.533 ms  ┊ GC (mean ± σ):  21.52% ± 16.19%

  █▄                            ▁
  ██▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁█▆▆▆▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▆ ▁
  339 ms           Histogram: frequency by time          612 ms <

 Memory estimate: 1.03 GiB, allocs estimate: 17275.

Julia version info:

julia> versioninfo()
Julia Version 1.9.0-DEV.840
Commit 68d62ab3d3 (2022-06-22 21:39 UTC)
Platform Info:
  OS: macOS (arm64-apple-darwin21.5.0)
  CPU: 8 × Apple M1
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-14.0.5 (ORCJIT, apple-m1)
  Threads: 4 on 4 virtual cores
Environment:
  JULIA_NUM_THREADS = 4

theabhirath · 2022-06-23T06:27:25Z

How is it that Zygote seems to be getting worse with passing Julia versions, though? I could've sworn it wasn't this bad a couple of weeks ago, and today it seems to be struggling to even calculate a ResNet-50 gradient?

darsnack · 2022-06-23T11:45:58Z

I'm confused how the new API contributes to better gradient times (at least for ResNet where there are no new layers added, right)?

A couple high level comments as you work on this:

Let's try to make the API more like "pass in what you want" than keywords. What I mean by this is that keywords corresponding to the stem can be eliminated by just having a single stem argument that the user passes in. Then, similar to block, we provide the useful defaults. So someone would pass in resnet_stem(64, :deep) (and we get to allow stuff beyond the defaults...with a "use at your own risk" warning).
Try to consolidate as many keyword arguments to into the blocks themselves. So if cardinality is just directly passed to the block, then we don't need to explicitly declare it in resnet. In terms of documentation, the default blocks' docstrings should detail these arguments and resnet defers to those.
You can submit a PR to the HuggingFace repo when you are ready to port the weights.

theabhirath · 2022-06-23T12:23:06Z

I'm confused how the new API contributes to better gradient times (at least for ResNet where there are no new layers added, right)?

Well to be completely honest, I'm not sure, but I have some theories that mostly revolve around how the nested Chains are in the two models. But I can't say for sure since I haven't really tried any of that out.

Let's try to make the API more like "pass in what you want" than keywords. What I mean by this is that keywords corresponding to the stem can be eliminated by just having a single stem argument that the user passes in. Then, similar to block, we provide the useful defaults.

This makes sense. We could make these NamedTuples or Dicts, maybe? It's a lot easier to keep track of names instead of the order of arguments to be passed in, and it's annoying to have to pass in five irrelevant arguments just to get to the last relevant one.

Try to consolidate as many keyword arguments to into the blocks themselves. So if cardinality is just directly passed to the block, then we don't need to explicitly declare it in resnet. In terms of documentation, the default blocks' docstrings should detail these arguments and resnet defers to those.

👍🏽

You can submit a PR to the HuggingFace repo when you are ready to port the weights.

This...might take time. The major issue is in terms of getting the model structures to overlap. DropBlock and DropPath add some functionality but at the cost of having some extra identitys in the model for the default cases, which is why I was looking at FluxML/Flux.jl#2004. I'll try and see what I can do, though

darsnack · 2022-06-23T12:34:03Z

We could make these NamedTuples or Dicts, maybe?

I was thinking even more declarative. Just have a function called resnet_stem or whatever you think is appropriate. So like resnet(stem = resnet_stem(mode = :tiered), ...) where resnet_stem actually returns the stem just like basicblock actually returns the block. This would allow for the same functionality as the keywords or named tuple, while also allowing the stem to be any model (so as flexible as possible).

I find these kinds of declarative interfaces are more flexible and easier to keep track of mentally. But they usually take more typing. Possibly we can merge your idea with this and allow a named tuple or Dict too. Then that dispatch should pass the pairs of the named tuple into resnet_stem by default. You would have some intermediate _make_stem(::NamedTuple) / _make_stem(x) that is called by the builder so that you can dispatch on the type.

It's a lot easier to keep track of names instead of the order of arguments to be passed in, and it's annoying to have to pass in five irrelevant arguments just to get to the last relevant one.

Yeah, I think the resnet_stem function does not need to be positional. It can accept keywords for this purpose.

The major issue is in terms of getting the model structures to overlap. DropBlock and DropPath add some functionality but at the cost of having some extra identitys in the model for the default cases, which is why I was looking at FluxML/Flux.jl#2004.

No hurry on this. Also, the script linked in the HuggingFace model cards doesn't depend on structure. It turns the Flux model into a state dict-like dictionary then just iterates the keys together with the PyTorch state dict. It might just work for your model since the DropBlock stuff does not contain parameters that would affect the state dict.

theabhirath · 2022-06-25T05:45:11Z

Try to consolidate as many keyword arguments to into the blocks themselves. So if cardinality is just directly passed to the block, then we don't need to explicitly declare it in resnet. In terms of documentation, the default blocks' docstrings should detail these arguments and resnet defers to those.

I was trying to come up with a more declarative API, but one of the problems that we might face is in terms of documentation. Since these blocks have a lot of arguments, directing end-users to refer to the documentation for these blocks might cause some confusion. I'm a little uncertain if that's desirable. Maybe we keep the declarative API but document the kwargs one level higher anyways?

This also causes quite a bit of argument hiding (i.e. builder functions aren't explicitly accepting the arguments to be passed to the lower level ones but instead a NamedTuple or Dict) which I'm not sure is the right way to go. The API becomes a little less clearer for both end users and package developers

darsnack · 2022-06-25T09:04:52Z

builder functions aren't explicitly accepting the arguments to be passed to the lower level ones but instead a NamedTuple or Dict

Don't they just accept something like block_args...?

Maybe we keep the declarative API but document the kwargs one level higher anyways?

That's okay

theabhirath · 2022-06-25T09:09:49Z

Don't they just accept something like block_args...

I could, but this has the same problem - the function doesn't clearly "see" the kwargs being passed in, which means we're essentially banking on users of this function to play nice. Which is fine, but it feels kinda wrong to have too many functions where the inputs aren't regulated

darsnack · 2022-06-25T16:26:07Z

This is a natural conflict between designing something to be flexible vs. safe. In general, Julia code tries to be more permissive, especially at the lower level API. This is what makes it possible to smash together two totally separate packages and get a useful result without too much hacking. This approach definitely requires more care, and I find the best way to work through this is to just try and be permissive until you hit a roadblock. Usually that experience is most informative about the design space. Let me try and walk through some of that process below.

it feels kinda wrong to have too many functions where the inputs aren't regulated

They are regulated, just not by resnet. For example, take the cardinality keyword and suppose it does not have a default. Then the user is expected to specify it.

If resnet explicitly has this keyword, then the user who fails to specify it will get a MethodError for the resnet method with the cardinality keyword highlighted as missing.
If resnet does not have this keyword, then the user who fails to specify it will get a MethodError for the basicblock (or whatever block_fn) method with the cardinality keyword highlighted as missing.

Either way, the user gets the same error. A similar outcome will happen for invalid keywords (with a slightly more informative error too) or for positional arguments. I would argue that getting the error for (2) is more informative because it signals that it is specifically the block_fn and the keyword that is incompatible. For (1), if we choose to restrict block_fn, then we can validate the arguments and provide even more informative errors, but this means that the entire API design is less flexible. If arguments are meant to be in specified ranges, etc. then that kind of assertion should happen inside block_fn where the stack trace itself is as informative as possible about the location of the error.

Maybe there is a specific kind of error that you are expecting that isn't covered well here? We should discuss that case in more detail then. Also, remember that this is a fairly low-level portion of the API. There is an expectation that the user can read Julia errors here (i.e. not the same level as ResNet where we want to be very beginner friendly). I recommend reading oxinabox's post on Julia anti-patterns and specifically the section on over-specifying argument types for an intro to this "letting the error get thrown eventually" philosophy (relatedly, this post is also good here). Where we want to intercept errors early are cases where we expect the average user to be a beginner, or where the resulting default error is misleading or cryptic.

Documenting the interface is a related but different concern. The interface should appear intuitive by itself.

I would argue that many specified but restricted keywords is not intuitive. It requires reading the docstring to understand the behavior and how each one is used / when each is ignored. conv_bn is the perfect example of a poor keyword interface (which we've let go because it is very internal).

On the other hand, saying "arguments passed to block_fn" is unspecified but it clearly follows from your understanding of block_fn. If you are customizing the block_fn, then you must understand these keywords and their usage in block_fn no matter what the design is. But if you don't care about block_fn and want the default, then there is nothing for you to read and the interface naturally lets you ignore the keywords. Similarly, if you customize block_fn, then you can feel assured that the extraneous keywords for another block are irrelevant (and you are not forced to implement block_fn in a way that adheres to an interface that doesn't apply to you).

Here's an attempt at the docstring. Let me know what you think (and feel free to push back!). Of course, this would also require similar changes to _make_blocks to be more declarative too. This means basically factoring out code that _make_blocks does automatically into separate functions that can be passed in (e.g. the downsampling function doesn't need to be instantiated in _make_blocks and could be constructed and passed in as a single downsampler argument).

"""
    resnet(block, layers, stem = somedefault(); nclasses = 1000, inchannels = 3, output_stride = 32,
           reduce_first = 1, activation = relu,
           norm_layer = BatchNorm, drop_rate = 0.0,
           block_kwargs...)
Creates the layers of a ResNe(X)t model. If you are an end-user, you should probably use
[ResNet](@ref) instead and pass in the parameters you want to modify as optional parameters
there.
# Arguments:
  - `block` / `block_kwargs`: The residual block to use in the model and the keyword arguments for it. See [basicblock](@ref) and [bottleneck](@ref) for
    example. This is called like `block(inplanes, outplanes; stride, block_kwargs...)`.
  - `layers`: A list of integers representing the number of blocks in each stage.
  - `stem`: The initial stage that operates on the input before the residual blocks. This can be any model that accepts the input and is compatible with the blocks stage. Defaults to [`somedefault`](#).
  - `nclasses`: The number of output classes. The default value is 1000.
  - `inchannels`: The number of input channels to the model. The default value is 3.
  - `output_stride`: The net stride of the model. Must be one of [8, 16, 32]. The default value is 32.
  - `reduce_first`: Reduction factor for first convolution output width of residual blocks,
    Default is 1 for all architectures except SE-Nets, where it is 2.
  - `activation`: The activation function to use. The default value is `relu`.
  - `norm_layer`: The normalization layer to use. The default value is `BatchNorm`.
  - `drop_rate`: The rate to use for `Dropout` before the fully-connected classifier stage. The default value is 0.0.
If you are an end-user trying to tweak the ResNet model, note that there is no guarantee that
all combinations of parameters will work. In particular, tweaking `block_kwargs` is not
advised unless you know what you are doing.
"""

I think the line: "This is called like block(inplanes, outplanes; stride, block_kwargs...)" is clear about the usage of block_kwargs. It's unspecified, but it is validated (when block itself is called) and it is clear that I need to know what keywords block accepts to understand this. At a higher level, like ResNet, I might think about also including another section "# Block arguments" that explains the standard arguments for basicblock and bottleneck.

theabhirath · 2022-06-25T16:47:06Z

Thank you for that writeup, it does clear some stuff up! I might need to do some homework before I get back with a response, but the two blog posts in particular might be good starting points in terms of understanding programming patterns in Julia a little better. I think most of my worry revolves around making ResNet safe - if someone is using resnet I'm reasonably certain they know what they're doing. Right now ResNet is just a thin wrapper around resnet, though, so the documentation and the interaction is something I am trying to get a cleaner picture of as this PR shapes up

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

theabhirath · 2022-06-28T13:26:49Z

Okay, I've just pushed what I think is a more declarative interface (and it does look cleaner from the user's POV). This mostly revolves around exposing two arguments at the resnet level (and lower level builder functions as well): a *_fn and a *_args for the downsample block, the model stem and the main block. The *_fn is to allow flexibility around what choice to use, and the *_args is a NamedTuple for passing in arguments to the *_fn.

I'm planning to rigorously document the choices of *_fn and *_args at the resnet level. So in this scheme, there are three "levels" we are catering to:

End users who don't really care about experimental choices and just want to be able to instantiate a ResNet quickly without having to sift through a lot of complicated documentation. Currently, this is easy enough because ResNet will not have a lot of documentation around it. It will redirect the advanced user to consider resnet instead.
Advanced users and writers of packages that depend on Metalhead.jl - for this level, the documentation surrounding resnet should be enough to try various experimental options without breaking stuff (of course, we will not guarantee this 😄 ).
Metalhead.jl devs - unfortunately enough, we really do need to know exactly how every function works 😂. At this level, contributors and devs can read through comments and docstrings that I will populate for all these functions explaining practically everything so that there's no confusion on all the possible options.

The docs for this are missing because I wanna make sure that this interface is something that can be agreed upon before I proceed to write it up 😅 Any feedback is welcome!

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

`downsample_args` is actually redundant

theabhirath · 2022-06-29T14:25:09Z

Oh no. Did I manage to kill CI altogether somehow?

darsnack

The design looks good, mostly minor changes here and there. I've been holding off on doing a full pass through all the other non-ResNet code, so I just did that and most of my comments are in those sections.

src/convnets/convmixer.jl

src/convnets/resnets/core.jl

darsnack · 2022-07-29T15:39:48Z

src/convnets/resnets/core.jl

+        # inplanes increases by expansion after each block
+        inplanes = planes * expansion


Suggested change

# inplanes increases by expansion after each block

inplanes = planes * expansion

We need this, though. This is calculating the change in inplanes across blocks

Maybe I am missing something but I don't see where the output of this calculation goes? It seems unused...unless it is modifying a global which is very bad.

We were before, unfortunately. I've pushed a change. This makes resnet_planes return a vector instead of being a stage_idx based callback - the reason we need this is because inplanes needs the planes from the previous block, not the current one, so we need to have access to that information

src/convnets/resnets/core.jl

src/convnets/resnets/seresnet.jl

src/layers/conv.jl

src/layers/pool.jl

test/convnets.jl

theabhirath · 2022-07-29T17:10:05Z

I've incorporated some of the docs changes, and left out the others - these will need a thorough once-over anyways, and I want to try and get those in at the same time as the devdocs and the Documenter.jl port

Co-Authored-By: Kyle Daruwalla <[email protected]>

Also misc. formatting and cleanup

theabhirath · 2022-07-30T13:50:51Z

I've also added Wide ResNet now (easy enough). But the CI is weird. I think my filtering should work but the ResNet testset isn't executing at all

theabhirath · 2022-07-31T19:10:06Z

Bump?

darsnack

Looks done to me. I just caught a couple last doc fixes and tests.

src/convnets/resnets/core.jl

darsnack · 2022-08-01T23:37:21Z

src/convnets/resnets/core.jl

+        # inplanes increases by expansion after each block
+        inplanes = planes * expansion


Maybe I am missing something but I don't see where the output of this calculation goes? It seems unused...unless it is modifying a global which is very bad.

src/convnets/resnets/core.jl

darsnack · 2022-08-01T23:44:01Z

src/convnets/resnets/core.jl

+    return Chain(stages...)
+end
+
+function resnet(img_dims, stem, get_layers, block_repeats::Vector{<:Integer}, connection,


Docstring for each resnet?

darsnack · 2022-08-01T23:45:40Z

src/layers/conv.jl

+function depthwise_sep_conv_norm(kernel_size, inplanes, outplanes, activation = relu;
+                                 norm_layer = BatchNorm, revnorm = false,
+                                 use_norm = (true, true), stride = 1, kwargs...)


Think this needs a docstring update

test/convnets.jl

.github/workflows/CI.yml

darsnack

Great job @theabhirath! This is a HUGE improvement, so I appreciate all the time you put into it. I'm gonna let tests run to completion.

theabhirath · 2022-08-02T03:13:43Z

Yeah I think this is the longest PR in terms of review comments on this repo, but it thoroughly deserved the discussion 😄 Happy to see this one through

theabhirath · 2022-08-02T09:04:58Z

I've also now made PRs to the HuggingFace repositories for the models. Once they're accepted, I'll push the updated pretrained weights links and SHAs as well. It would be good to have all the tests enabled and all the tasks ticked off 😄

theabhirath · 2022-08-02T13:20:08Z

I've also now made PRs to the HuggingFace repositories for the models. Once they're accepted, I'll push the updated pretrained weights links and SHAs as well. It would be good to have all the tests enabled and all the tasks ticked off 😄

On second thoughts, might not want this to block the PR....I want to try and use the updated torchvision weights with higher accuracies - there's been some API changes so this may take a little more time

theabhirath force-pushed the resnet-plus branch from 71cba4d to c58ba47 Compare June 22, 2022 02:04

theabhirath mentioned this pull request Jun 23, 2022

Add inchannels, imsize, nclasses as kwargs for all constructors #176

Open

theabhirath force-pushed the resnet-plus branch from ed4145e to eb4fd59 Compare June 24, 2022 02:26

theabhirath added 4 commits June 27, 2022 06:38

Add DropBlock

cd0edef

Initial commit for new ResNet API

271b430

Cleanup

866dbcc

Get some stuff to work

a038ff8

1. Some docs 2. Basic tests for ResNet and ResNeXt now pass

theabhirath force-pushed the resnet-plus branch from eb4fd59 to 803f8c4 Compare June 27, 2022 01:11

theabhirath added 2 commits June 27, 2022 11:11

Tweaks - I

de079bc

Make pretrain condition explicit

4fa28d4

theabhirath force-pushed the resnet-plus branch from 803f8c4 to 4fa28d4 Compare June 27, 2022 05:42

More declarative interface for ResNet

7846f8b

1. Less keywords for the user to worry about 2. Delete `ResNeXt` just for now

theabhirath force-pushed the resnet-plus branch from 0ef7496 to 7846f8b Compare June 28, 2022 13:28

theabhirath added 2 commits June 28, 2022 22:58

Make DropBlock really work

a1d5ddc

Construct the stem outside and pass it into resnet

3be1d81

`downsample_args` is actually redundant

theabhirath force-pushed the resnet-plus branch from ae496b4 to c06d963 Compare June 29, 2022 14:22

theabhirath closed this Jun 29, 2022

theabhirath reopened this Jun 29, 2022

theabhirath force-pushed the resnet-plus branch from b1e6b42 to b143b95 Compare July 29, 2022 16:01

darsnack requested changes Jul 29, 2022

View reviewed changes

theabhirath force-pushed the resnet-plus branch from ccb54da to 2aa3459 Compare July 29, 2022 16:54

theabhirath force-pushed the resnet-plus branch from da5321d to ed93737 Compare July 29, 2022 17:11

Cleanup - docs and code

fc74aa1

Co-Authored-By: Kyle Daruwalla <[email protected]>

theabhirath force-pushed the resnet-plus branch 2 times, most recently from d1d193a to 07c5c64 Compare July 29, 2022 17:53

Make all config dicts const and capitalise

99eb25a

Also misc. formatting and cleanup

theabhirath force-pushed the resnet-plus branch from 07c5c64 to 99eb25a Compare July 29, 2022 18:20

theabhirath requested a review from darsnack July 29, 2022 18:40

Formatting, and some tweaks

73131bf

theabhirath force-pushed the resnet-plus branch from ced84a4 to 73131bf Compare July 30, 2022 12:21

Add WideResNet

73df024

theabhirath mentioned this pull request Aug 1, 2022

Expose a uniform API at the highest level for models #190

Merged

theabhirath added the breaking label Aug 1, 2022

darsnack requested changes Aug 1, 2022

View reviewed changes

darsnack reviewed Aug 1, 2022

View reviewed changes

.github/workflows/CI.yml Outdated Show resolved Hide resolved

Don't use globals

72cd4a9

theabhirath force-pushed the resnet-plus branch from 04e46c0 to 72cd4a9 Compare August 2, 2022 02:52

theabhirath requested a review from darsnack August 2, 2022 02:56

darsnack approved these changes Aug 2, 2022

View reviewed changes

darsnack merged commit 7e4f9db into FluxML:master Aug 2, 2022

theabhirath deleted the resnet-plus branch August 2, 2022 13:53

theabhirath mentioned this pull request Aug 3, 2022

Res2Net and Res2NeXt, again #195

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overhaul of ResNet API #174

Overhaul of ResNet API #174

theabhirath commented Jun 21, 2022 •

edited

Loading

theabhirath commented Jun 23, 2022 •

edited

Loading

theabhirath commented Jun 23, 2022

darsnack commented Jun 23, 2022

theabhirath commented Jun 23, 2022 •

edited

Loading

darsnack commented Jun 23, 2022 •

edited

Loading

theabhirath commented Jun 25, 2022 •

edited

Loading

darsnack commented Jun 25, 2022

theabhirath commented Jun 25, 2022

darsnack commented Jun 25, 2022

theabhirath commented Jun 25, 2022 •

edited

Loading

theabhirath commented Jun 28, 2022

theabhirath commented Jun 29, 2022

darsnack left a comment

darsnack Jul 29, 2022

theabhirath Jul 29, 2022

darsnack Aug 1, 2022

theabhirath Aug 2, 2022

theabhirath commented Jul 29, 2022

theabhirath commented Jul 30, 2022

theabhirath commented Jul 31, 2022

darsnack left a comment

darsnack Aug 1, 2022

darsnack Aug 1, 2022

darsnack Aug 1, 2022

darsnack left a comment •

edited

Loading

theabhirath commented Aug 2, 2022

theabhirath commented Aug 2, 2022 •

edited

Loading

theabhirath commented Aug 2, 2022

		# inplanes increases by expansion after each block
		inplanes = planes * expansion

Overhaul of ResNet API #174

Overhaul of ResNet API #174

Conversation

theabhirath commented Jun 21, 2022 • edited Loading

Why this PR?

Things to do

Other PRs to land before this one

Miscellaneous fixes

theabhirath commented Jun 23, 2022 • edited Loading

theabhirath commented Jun 23, 2022

darsnack commented Jun 23, 2022

theabhirath commented Jun 23, 2022 • edited Loading

darsnack commented Jun 23, 2022 • edited Loading

theabhirath commented Jun 25, 2022 • edited Loading

darsnack commented Jun 25, 2022

theabhirath commented Jun 25, 2022

darsnack commented Jun 25, 2022

theabhirath commented Jun 25, 2022 • edited Loading

theabhirath commented Jun 28, 2022

theabhirath commented Jun 29, 2022

darsnack left a comment

Choose a reason for hiding this comment

darsnack Jul 29, 2022

Choose a reason for hiding this comment

theabhirath Jul 29, 2022

Choose a reason for hiding this comment

darsnack Aug 1, 2022

Choose a reason for hiding this comment

theabhirath Aug 2, 2022

Choose a reason for hiding this comment

theabhirath commented Jul 29, 2022

theabhirath commented Jul 30, 2022

theabhirath commented Jul 31, 2022

darsnack left a comment

Choose a reason for hiding this comment

darsnack Aug 1, 2022

Choose a reason for hiding this comment

darsnack Aug 1, 2022

Choose a reason for hiding this comment

darsnack Aug 1, 2022

Choose a reason for hiding this comment

darsnack left a comment • edited Loading

Choose a reason for hiding this comment

theabhirath commented Aug 2, 2022

theabhirath commented Aug 2, 2022 • edited Loading

theabhirath commented Aug 2, 2022

theabhirath commented Jun 21, 2022 •

edited

Loading

theabhirath commented Jun 23, 2022 •

edited

Loading

theabhirath commented Jun 23, 2022 •

edited

Loading

darsnack commented Jun 23, 2022 •

edited

Loading

theabhirath commented Jun 25, 2022 •

edited

Loading

theabhirath commented Jun 25, 2022 •

edited

Loading

darsnack left a comment •

edited

Loading

theabhirath commented Aug 2, 2022 •

edited

Loading