Any specific recs for further reading?
I suspect the quality LLM development teams will pursue the same in-depth data sourcing & cleaning techniques that quality ML researchers are developing today. Or rather, they'll do something similar in principle to mitigate this issue.
I still agree with your conclusions. It will be a bigger consideration and less scrupulous teams will be more effected.
CFinley97
joined 1 year ago
Thank you! Super interesting to know that's the starting point for that term