North Carolina State University1, Lawrence Berkeley National Laboratory2,
Operational profiles are an essential part of software reliability engineering. Typically they are created from the software requirements, and through customer reviews. Creation of operational profiles often is laborious and requires human intervention. Our approach builds an operational profile based on the actual usage from execution logs. The difficulty in using execution logs is that the amount of data to be analyzed is extremely large (more than a million records per day in many applications). Our solution identifies all the possible clustered sequences of events (patterns) that exist in the logs very efficiently using suffix arrays data structure.