deep_learning_theory Core supervised_learning logistic_regression cross_entropy_loss gradient_descent adaptive_optimizers Backprop + implementation backpropagation backpropagation_through_time gradient_checking weight_initialization batch_normalization vectorization neural_network_notation Metric learning + style transfer triplet_loss style_cost_function