Training AI models effectively. Is it better to have more data or higher quality data for AI training?