The Importance of Using “Real” Test Data

During development, data is usually required to effectively test an application. The test data used during development often doesn’t reflect the real data that will be used in production. I recently worked on a large web-based application that was developed using test data that developers wrote. This data was quite meaningless (often comical) and there was just enough of it to fill the layout. At the time this seemed fine, after all, it’s only test data, right? So what happened when we eventually migrated the thousands of records of real data? Lo and behold, things no longer fitted on the page and performance became a real issue. This caused a delay to fix the problems before we went live. Lesson learned:

Use test data that accurately reflects production data in both content and quantity.

Ok, so when should real data be introduced into the system? Before writing any code would be ideal! That doesn’t mean get all the data up-front, just enough to demonstrate each feature as it’s developed. In good agile fashion, a test data suite should be developed incrementally along with the application. When anyone (developer, tester, customer, user, etc) is testing out a new feature, they should be seeing how it functions with production-standard data.

So, where do we obtain real data? From the people who know best – the customers. They may be able to provide you with some data they already have in a database or spreadsheet. If not, then send them a spreadsheet containing the required columns and kindly ask them to populate it with real-world data.

Maintain a test data script

Automate the creation of test data using a database script. Keep this script up to date with schema changes and wipe-down the test data often to ensure it is kept up to date.

Use paging and filtering

When displaying a list of data it is important to think about how much of the data will be displayed at one time. If there is potential for a large amount of data to be displayed, then use paging or filtering to ensure that the page can handle large sets of data. If you don’t implement paging, it is safest to cap the number of records that can be displayed. This ensures the application will continue to function with large sets of data.

Real data improves the understanding of a system

By developing an application using real test-data, both customers and developers gain a better understanding of how the system will be used. Customers are able to see how their information is handled and displayed by the system. Developers can see what kind of information the customer will be entering and ensure the design suits the customer’s needs.

Identify problems early

Real test-data allows us to identify potential performance issues and display problems early. It can also help to identify and correct serious flaws in functionality early in the development cycle.


0 Responses to “The Importance of Using “Real” Test Data”

  1. Leave a Comment

Leave a Reply

Please log in using one of these methods to post your comment: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s

%d bloggers like this: