At the beginning of this year I was starting to get into machine learning from web development. Data cleaning was one of the things which I found extremely difficult.
Here& #39;s how you can get started with data cleaning.
(so that you don& #39;t make the mistakes I did)
https://abs.twimg.com/emoji/v2/... draggable="false" alt="🧵" title="Thread" aria-label="Emoji: Thread">
https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten">
Here& #39;s how you can get started with data cleaning.
(so that you don& #39;t make the mistakes I did)
First of all what is data cleaning?
https://abs.twimg.com/emoji/v2/... draggable="false" alt="🤔" title="Denkendes Gesicht" aria-label="Emoji: Denkendes Gesicht">
Data cleaning is the process of properly formatting your data before you feed it to your neural network. This is very important as there can be serious performance hits to the accuracy of your neural net if the data fed in is not right.
Data cleaning is the process of properly formatting your data before you feed it to your neural network. This is very important as there can be serious performance hits to the accuracy of your neural net if the data fed in is not right.
In the real world, data will be incredibly messy. It is your job to filter the data and format it the right way. This picture explains Data cleaning really well
https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten">
So how do you get started with data cleaning?
You must know slightly advanced concepts, check out this thread for more info
https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten"> https://twitter.com/PrasoonPratham/status/1313745702439153664?s=20">https://twitter.com/PrasoonPr...
You must know slightly advanced concepts, check out this thread for more info
Now let& #39;s look at the libraries you must learn
https://abs.twimg.com/emoji/v2/... draggable="false" alt="👇" title="Rückhand Zeigefinger nach unten" aria-label="Emoji: Rückhand Zeigefinger nach unten">
Pandas : Load data from files
Numpy : Modify Data loaded from Pandas
Matplotlib + Seaborn : Visualise Data
Pandas : Load data from files
Numpy : Modify Data loaded from Pandas
Matplotlib + Seaborn : Visualise Data
Where to learn them from?
FreeCodeCamp has you covered with this course https://www.youtube.com/watch?v=r-uOLxNrNk8&t=895s">https://www.youtube.com/watch...
FreeCodeCamp has you covered with this course https://www.youtube.com/watch?v=r-uOLxNrNk8&t=895s">https://www.youtube.com/watch...
Practising these skills on Kaggle is the next thing you have to do!
The Titanic dateset is the best place to start from. https://www.kaggle.com/c/titanic ">https://www.kaggle.com/c/titanic...
The Titanic dateset is the best place to start from. https://www.kaggle.com/c/titanic ">https://www.kaggle.com/c/titanic...