Machine Learning: 2 Books in 1: Machine Learning for Beginners, Machine Learning Mathematics. An Introduction Guide to Understand Data Science Through the Business Application



Download 1,94 Mb.
Pdf ko'rish
bet35/96
Sana22.06.2022
Hajmi1,94 Mb.
#692449
1   ...   31   32   33   34   35   36   37   38   ...   96
Bog'liq
2021272010247334 5836879612033894610

Things you must know for machine
learning
To be successful with machine learning, you must have the right tools in
order to work, just like if you were building a house, you would need to
skills and the tools required. The following is a list of the required materials
to do machine learning.
Data
To start working with your data, you have to have enough data to break it
into two categories; training data and test data.
Training data is the data you use in the beginning when you are building
your model. When you are first creating your model, you need to give it
some data to learn from. With training data, you will already know the


independent variables as well as their respective dependent variables. This
means that for every input, you will already know the output of your data.
From this data, your model will learn to predict the output on its own. Our
training data gives us the parameters we need to make predictions. This is
the data that our machine learns from.
Test data is the data that the machine gets once you are satisfied with the
model, and you see what it does out in the wild. In this data, we only have
the independent variables, but no output. With test data, we can see how
well our model does at predicting an outcome with new data.
Your training data should account for most of your data; approximately
70%, while your test data is the remaining 30%. In order to avoid bias,
make sure that the data you choose for training data and test data is totally
random when you split them up. Don’t choose which data to use; let it be
random. Don’t use the same data for training and testing. Start by giving the
training data to the machine and examine the relationships between X and
Y, then try to see how well your model did.
The most important question to consider during this process is whether your
model will still work when it is presented with new data. You can test this
by doing cross-validation. This means you will test your model on data you
have not used yet. Keep some data to the side that you didn't use during
training to see how accurate your model is at the end.
You can also use K-fold validation to check the accuracy of your model.
This method is pretty easy to use and generally unbiased. It’s a good
technique to use when we don’t have a lot of data to work with for testing.
For K-fold validation, we will break our data into k folds, usually between 5


and 10. Test each fold and see how they performed across all the folds once
you are finished with testing. Usually, the larger your number for k is the
less biased your test will be.
So far, we have talked about models interpreting data to find meaning and
patterns. But what kind of data are we going to use? Where will we get our
data, and what is it going to look like?
Data is the most critical component for machine learning. After all, your
model will only learn with data, so it’s important that you have data that is
relevant and meaningful. It came come in many shapes and sizes, structure
differently depending on the kinds of data. The more structured the data is,
the easier it is to work with. Some data has very little structure, and this
data is harder to interpret. Data for facial recognition can be huge and have
very little meaning to the untrained eye.
Structured data is more organized. This is the type of data that you will
likely use when you are first starting out. It will help you get your feet wet,
and you can start understanding the statistic involved in machine learning.
Usually, structure data will come in a familiar form that looks something
like this, in rows and columns. This is called a tabular dataset.

Download 1,94 Mb.

Do'stlaringiz bilan baham:
1   ...   31   32   33   34   35   36   37   38   ...   96




Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©hozir.org 2024
ma'muriyatiga murojaat qiling

kiriting | ro'yxatdan o'tish
    Bosh sahifa
юртда тантана
Боғда битган
Бугун юртда
Эшитганлар жилманглар
Эшитмадим деманглар
битган бодомлар
Yangiariq tumani
qitish marakazi
Raqamli texnologiyalar
ilishida muhokamadan
tasdiqqa tavsiya
tavsiya etilgan
iqtisodiyot kafedrasi
steiermarkischen landesregierung
asarlaringizni yuboring
o'zingizning asarlaringizni
Iltimos faqat
faqat o'zingizning
steierm rkischen
landesregierung fachabteilung
rkischen landesregierung
hamshira loyihasi
loyihasi mavsum
faolyatining oqibatlari
asosiy adabiyotlar
fakulteti ahborot
ahborot havfsizligi
havfsizligi kafedrasi
fanidan bo’yicha
fakulteti iqtisodiyot
boshqaruv fakulteti
chiqarishda boshqaruv
ishlab chiqarishda
iqtisodiyot fakultet
multiservis tarmoqlari
fanidan asosiy
Uzbek fanidan
mavzulari potok
asosidagi multiservis
'aliyyil a'ziym
billahil 'aliyyil
illaa billahil
quvvata illaa
falah' deganida
Kompyuter savodxonligi
bo’yicha mustaqil
'alal falah'
Hayya 'alal
'alas soloh
Hayya 'alas
mavsum boyicha


yuklab olish