Newer studies [0] show us that
a) p-hacking is still alive and well in the psychological community.
b) p-curves are not sufficient for detecting this.
That isn't a lack of proper design. It's a case of statistics being abused to show significance when there is none.
[0] https://psycnet.apa.org/doi/10.1027/2151-2604/a000383