{"id":48387,"date":"2019-05-14T07:00:48","date_gmt":"2019-05-14T06:00:48","guid":{"rendered":"https:\/\/www.clickworker.com\/?p=48387"},"modified":"2022-07-25T17:36:34","modified_gmt":"2022-07-25T16:36:34","slug":"realistic-training-data-for-machine-learning","status":"publish","type":"post","link":"https:\/\/www.clickworker.com\/customer-blog\/realistic-training-data-for-machine-learning\/","title":{"rendered":"Realistic training data for machine learning"},"content":{"rendered":"

\"training<\/p>\r\n

Data are the foundation for training algorithms. The more realistic the data, the better the results. This is because artificial intelligence is based on precise and reliable information for training its algorithms. This is obvious but it is often overlooked. The training data are realistic when they reflect the data that the AI system gathers in real operation. Unrealistic data sets prevent machine learning and lead to expensive false interpretations. <\/p>\r\n\r\n\r\n\r\n\r\n

Unsuitable training data are expensive<\/h2>\r\n\r\n

Artificial neural networks need to be fed good input to be able to learn – just like the human brain. Ultimately, it is the data that are used to train the systems that will determine what an AI system knows and can accomplish. When using artificially created and open data as training data you run a great risk of obtaining distorted results because the data are often not realistic. Artificial intelligence consists of algorithms that are fed data from which they are meant to learn – so-called machine learning. If data are used that are not realistic with regard to their use in the system, this can lead to insufficient or incorrect results in the system as illustrated in the following example. <\/p>\r\n\r\n

While developing a software for drone cameras the developers make use of photographs found on the Internet. These photos exist in ample supply on Facebook or Instagram. However, these photos have two typical features: <\/p>\r\n\r\n