Ah. I'd look at keras examples - they have alexnet and VGG right now. Although, an exact implementation may be hard to find - the best you can do is an "alexnet~ish" implementation keeping some keras graph limitations on convs.
I haven't run any in a while so I don't have the data myself, unfortunately.
Great work btw.
What was the motivation to build leaf?