Notes

April 24, 2020April 26, 2020 Quang General

On Proximal Policy Optimization

This is the story of a man named Proximal Policy Optimization (You may want to read this along with this article). His friends call him PPO. He's one of reinforcement learning most popular guys. PPO is a simple man, so simple that you can summarize how he acts in any given situation in just under … Continue reading On Proximal Policy Optimization

March 4, 2020 Quang AI

The Summary of Basic Reinforcement Learning Algorithms

Vanilla Policy Gradient (VPG): Make trajectories leading to rewards more likely to be chosen by the policy On policy Discrete and continuous Trust Region Policy Optimization (TRPO): Make trajectories leading to rewards more likely to be chosen by the policy but take the largest step in each update such that the KL-divergence between the updated … Continue reading The Summary of Basic Reinforcement Learning Algorithms

March 4, 2020 Quang AI

A refreshed take on AI

I recently came back to AI research with a new perspective. Previously, I built everything bottom up. Given that I wanted to do research in Computer Vision (CV), I would spend weeks learning all the basics of CV. It's like going into a rabbit hole, there are always so many things to learn that I … Continue reading A refreshed take on AI

February 27, 2020 Quang Weird stuffs

The Deb: acceleration of free falls

The deb not surprisingly have a system of scientific laws, rather very basic ones. For example, they know that objects fall down equally fast regardless of their mass. However, the way that they came up with it is quite interesting. We human discovered that by dropping objects of different masses from the top of the … Continue reading The Deb: acceleration of free falls

February 27, 2020 Quang Weird stuffs

The Deb: the garden corner

In the upper left corner of the garden, we have a wall that's made from lamps. Lots of lamps. Not normal lamps to be exact but fruits of the lightbulb tree. On the bottom of the wall, a tiny forest of mushroom grows. They give off beautiful dim green light at night. On the mushroom … Continue reading The Deb: the garden corner

November 9, 2017 Quang General

Illusion 100

We really want to have illusion 100 in Skyrim. But when we're confronted with the choice to use command cheat or not, we stopped. It's not about being a powerful mage. It's about becoming a powerful mage. If we can choose to suddenly have great ability in artificial intelligence, would we accept? We would but … Continue reading Illusion 100

November 9, 2017November 9, 2017 Quang General

Working in AI

I'm proud to tell everyone that I'm an AI scientist. That I can make machine tells which picture is a hotdog and which is not.

November 6, 2017 Quang General

What internship do you want?

We don’t know. We never tried to find out. We haven’t look deep enough. Internship is for learning specific, practical things, not to make profit for the company or to please people. So what specific, practical things do we want to learn? How do people do science, how do people invent things, how do people … Continue reading What internship do you want?

November 6, 2017November 6, 2017 Quang General

About summer internship

What’s great about it that you want it so much? It’s a good thing to put on resume to make hirers want me. What’s so great about having a job? So that I won’t be like the arrogant jobless people that my family so fond of talking about. So, if you get a job, who … Continue reading About summer internship

November 5, 2017November 5, 2017 Quang General

So what if the worst thing happen?

Suppose after finishing college we can't find a job in America. Is that so bad? We don't know. If we don't know then it must not be bad. We don't even know if we want it or not. What do we want to do after we graduate? No, why wait until graduation. We'll do it … Continue reading So what if the worst thing happen?