What is Google’s New Policy to Train AI?

Google updates its Privacy Policy  to use Public Data for AI Training

In its recent policy update, tech giant Google decided to collect data from all available sources on the internet to train its AI models, including Bard.

Under the new policy, Google will be able to collect data from a variety of public sources, including social media posts, government records, and the web. This data will be used to train AI models for various purposes, such as spam filtering, fraud detection, and language translation.

Google maintains that the use of public data is necessary to train AI models that are accurate and effective. The company also says it will take steps to protect user privacy, such as de-identifying data before it is used to train models.

Google’s policy page states: “We may share non-personally identifiable information publicly and with our partners, such as publishers, advertisers, developers or rights holders. For example, we share information publicly to show trends about the general use of our services.”

What is Google’s new policy?

Google’s policy on public data collection is not very transparent, so users should read the policy carefully to understand what information Google collects.

Here’s what the policy update mentions: “Google uses information to improve our services and develop new products, features and technologies that benefit our users and the public.

“For example, we may collect information that is publicly available online or from other public sources to help train Google’s AI models and build products and features such as Google Translate, Bard, and cloud AI capabilities. Or, if your business information appears on a website, we can index it and display it on Google services,” he added.

Previously, the company used this information to update and train its language models, improving its already available products, such as Google Translate. Now, the company has clearly mentioned that all public data will be used to upgrade its AI products.

Jagranjosh

Source: Google

The image above comes from Google policy files in which the green color represents newly added information.

The dangers of data scraping

This new policy update may cause serious cases of data mining and privacy issues. While companies typically keep user data confidential for future use and new product development, Google’s new policy allows the company to use any publicly available information to train its AI models.

This means that Google can access and process any type of data that is available on the Internet, including personal information. The company has mentioned that it anonymizes sources, but that can still be a problem.

First of all, it can violate people’s privacy. When data is extracted without permission, people may not know that their data is being collected or how it is used. This can lead to a number of problems, such as identity theft and financial fraud.

Second, data mining can be used to create biased AI models. If AI models are trained with data pulled from the Internet, the models can reflect biases that are already present in the data. This can lead to AI models that discriminate against certain groups of people.

Finally, data mining can disrupt the Internet. The most recent example of this is the Twitter outage. When data is extracted from websites, it can slow down and make it difficult to use.

Elon Musk showed his concern about data theft and decided to limit the number of tweets people can read per day. He is also continuously working to make the platform more secure by monetizing different services.

In conclusion, the new policy can surely help Google come up with powerful AI, but it will also be a security hazard. The policies could also lead to increased data mining and privacy violations. It is important to carefully monitor how Google implements these policies.

Categories: Optical Illusion
Source: ptivs2.edu.vn

Leave a Comment