Generative AI is built on large training data sets. Data can come from scraping content off of the internet or getting data sets from companies directly. Most people do not think about the amount of data that is being collected about them. Ethical companies will protect your data, but you should consider that they may be in the minority. Issues include:
Example:
You are using an AI tool that your healthcare provider promotes. You input your symptoms and, paired with your personal health data, receive a recommendation for treatment. If that system were hacked, it would expose an enormous amount of information about you. In addition to exposing PII, it could result in targeted healthcare scams, affect employment status, etc.
If you want to use an AI tool that requires very personal information, read the fine print to understand what you are agreeing to and how to protect yourself.
The ethics around Intellectual Property affect both the data going in and the data coming out and the legal issues are not settled. Issues include:
Example:
Your article is going to be published in a journal and you sign over copyright to the publisher. You do this for the benefit of increasing awareness of your work and the affect it may have on your reputation. The publisher later sells their content to a company creating an AI tool that turns it into customizable online tutorials, with no credit or compensation to you or the other authors.
While it may be legal, is it ethical? Can you put yourself in the shoes of the content creators?
Attention must be paid to including the appropriate training data and taking steps to remove bias from the evaluation process. AI is trained to look for patterns as a way to learn skills, so careful data set curation and algorithm transparency are required to ensure fairness and inclusivity. Issue include:
Example:
Amazon created a tool to screen resumes. It was built from the resumes of top-performing employees. The result was that the tool took on the human biases of the manual screening process, which initially resulted in hiring these people. AI recognizes patterns in the training data and, by design, runs with them - good or bad.
Keep this in mind when evaluating content from an AI tool. Question if there may be bias present in the results.
The environmental impact of generative AI primarily stems from the large computational resources required to train and run these models. This is an ethical challenge for AI users. Issues include:
It’s an exciting world of opportunity, but is the environmental impact worth the reward?
ChatGPT was used to generate ideas for this topic, using prompts such as “Briefly describe ethical issues related to generative AI.”