Introduction to GANs
Discriminator learns from real dataset and make judgement of input data.
Generator uses noise vestor to generate fake data and feed into discriminator, based on the feedback it can generate better samples.
Planning means identifying appropriate actions and their sequence, it’s the basis of rational behavior.
We need to design intelligent machines because:
So the machine should learn how to work rationally by itself.
Alpha Go: beat the best human Go player
AlphaFold: a solution to the protein folding
Reinforcement Learning is a feedback-based machine learning technique: an agent learns to behave in an environment by performing the actions and getting the feedback.
Sequential decision making + long-term goal(getting the maximum positive rewards)
No pre-program and only learns from its own experience
Perceives its environment through sensors and acts upon the environment through actuators
Rational agent: Acts to achieve the best (expected) outcome
For each possible percept sequence, a rational agent should select an action that is expected to maximize its performance measure, given the evidence provided by the percept sequence and whatever built-in knowledge the agent has.
Regular ML: build a model based on sample data $x$ in order to make prediction
Attack: add invisible noise $\eta$ to clean sample data and fool the model(make the wrong prediction)
Adversarial Machine Learning: attempts to fool models by supplying deceptive input
The malfunction of a ML model may cause damage in many aspects.
Paper: https://arxiv.org/abs/1902.04885
As the technology grows rapidly, more and more attentions are paid to Artificial Intelligence, one of the most famous examples is Alpha Go. It used 30000 games to train the model and finally defeated the top human Go players. The current rapid development of AI largely depends on the availability of Big Data.