Články (1311)

# Titulek Zdroj Kategorie Score FB TG Staženo
1094 Competitive self-play OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1095 Nonlinear computation in deep linear networks OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1096 Learning to model other minds OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1097 Learning with opponent-learning awareness OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1098 OpenAI Baselines: ACKTR & A2C OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1099 More on Dota 2 OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1100 Dota 2 OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1101 Gathering human feedback OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1102 Better exploration with parameter noise OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1103 Proximal Policy Optimization OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1104 Robust adversarial inputs OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1105 Hindsight Experience Replay OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1106 Teacher–student curriculum learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1107 Faster physics in Python OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1108 Learning from human preferences OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1109 Learning to cooperate, compete, and communicate OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1110 UCB exploration via Q-ensembles OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1111 OpenAI Baselines: DQN OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1112 Robots that learn OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1113 Roboschool OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1114 Equivalence between policy gradients and soft Q-learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1115 Stochastic Neural Networks for hierarchical reinforcement learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1116 Unsupervised sentiment neuron OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1117 Spam detection in the physical world OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1118 Evolution strategies as a scalable alternative to reinforcement learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1119 One-shot imitation learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1120 Distill OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1121 Learning to communicate OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1122 Emergence of grounded compositional language in multi-agent populations OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1123 Prediction and control with temporal segment models OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1124 Third-person imitation learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1125 Attacking machine learning with adversarial examples OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1126 Adversarial attacks on neural network policies OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1127 Team update OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1128 PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood ... OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1129 Faulty reward functions in the wild OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1130 Universe OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1131 OpenAI and Microsoft OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1132 #Exploration: A study of count-based exploration for deep reinforcement learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1133 On the quantitative analysis of decoder-based generative models OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1134 A connection between generative adversarial networks, inverse reinforcement lear... OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1135 RL²: Fast reinforcement learning via slow reinforcement learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1136 Variational lossy autoencoder OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1137 Extensions and limitations of the neural GPU OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1138 Semi-supervised knowledge transfer for deep learning from private training data OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1139 Report from the self-organizing conference OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1140 Transfer from simulation to real world through learning deep inverse dynamics mo... OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1141 Infrastructure for deep learning OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1142 Machine Learning Unconference OpenAI Blog ai 0.50 Ne 2026-02-26 08:25
1143 Team update OpenAI Blog ai 0.50 Ne 2026-02-26 08:25