Julia Evans

Day 13: BPTT, and debugging why a model isn't training is hard

Julia Evans · · 875 words
Loading…