Referências
[1] Vinod Nair Alex Krizhevsky e Geoffrey Hinton. THE CIFAR-10 DATABASE. 2012.
[2] Kian Katanforoosh Andrew Ng. Stanford University CS230 Deep Learning.
[3] Jimmy Lei Ba, Jamie Ryan Kiros e Geoffrey E. Hinton. Layer Normalization. 2016. [stat.ML].
[4] Richard Bellman. Dynamic Programming. Dover Publications, 1957. isbn: 9780486428093.
[5] Emma Brunskill. Stanford University CS234 Reinforcement Learning.
[8] Kaiming He et al. Deep Residual Learning for Image Recognition. 2015 [cs.CV]
[9] Yann LeCun. THE MNIST DATABASE of handwritten digits. 1998
[10] Shane Legg e Marcus Hutter. Universal Intelligence: A Definition of Machine Intelligence. 2007.
[11] DeepMind & University College London. Reinforcement learning course 2020.
[12] Tom M. Mitchell. Machine Learning. New York: McGraw-Hill, 1997. isbn: 978-0-07-042807-2.
[13] Andrew Ng. Stanford University CS229 Machine Learning.
[14] Andrew Ng. Stanford University Machine Learning.
[15] Alec Radford et al. “Language Models are Unsupervised Multitask Learners”. Em: (2019).
[16] Sebastian Raschka. Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning. 2020. [cs.LG].
[21] Ashish Vaswani et al. Attention Is All You Need. 2017. [cs.CL].