
Quality Control: Data
The variety of internet-based biological databases makes it difficult to list quality control methods. Some general factors to consider include:
- For a standalone database or repository:
- The type of databased being accessed and whether it is curated. If so, what are the curation criteria?
- What are the quality assurance procedures implemented by database developers?
- If the data are subject to change, how often is the database updated?
- Are metadata available describing experimental details that are not captured in the database?
- Is it possible to evaluate the quality of the data within the database?
- If data are edited in the curation process, are raw data also accessible?
- For a data warehouse or data integration tool, all of the above apply to the collaborating databases. Additional considerations include:
- Frequency of data updates from collaborating databases.
- Quality assurance procedures to ensure that data from collaborating databases are accurately reported by the data warehouse/integration tool.
- If the database is new, it might worth double checking results of simple queries with collaborating databases using the same query.
- Provide user-feedback to database managers helps to maintain data quality.