What are good problems for data science?