A related concept I've been thinking about a lot recently is "data literacy" - understanding that useful data comes in the form of a collection of similarly shaped observations, for example a clean table of rows and columns People with strong data literacy are more likely to understand why a PDF file of scanned paper forms split over multiple pages isn't a great way to distribute data!