Puneet Varma (Editor)

Semmle

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Industry
  
Software analytics

Founder
  
Oege de Moor

Founded
  
December 2006 (2006-12) in Oxford, England

Headquarters
  
Blue Boar Court, Alfred Street, Oxford, England

Key people
  
Oege de Moor, Pavel Augustinov, Julian Tibble

Products
  
Code analysis software and services

Semmle Limited is a software engineering analytics and code exploration provider, with offices in Oxford, New York City, and Copenhagen. Built on research in compilers and data analysis, developed by a team from the University of Oxford, its patented technology creates a knowledge base using all available data about the software development process (source code, issue tickets, development costs, team location, etc.), and allows its users to accurately and efficiently query that knowledge base. Results are presented in user interfaces designed for the different needs of developers, managers, and executives. The insights Semmle delivers help teams staff projects more intelligently, retain and develop talent better, control software costs, and deliver new functionality sooner.

Contents

The company is headquartered in Blue Boar Court, Alfred Street, central Oxford, England and customers include Citi, Credit Suisse, NASA and Dell.

Academic

SemmleCode builds on academic research on querying the source of software programs. The first such system was Linton's Omega system, where queries were phrased in QUEL. QUEL did not allow for recursion in queries, making it difficult to inspect hierarchical program structures such as the call graph. The next significant development was therefore the use of logic programming, which does allow such recursive queries, in the XL C++ Browser. The disadvantage of using a full logic programming language is however that it is very difficult to attain acceptable efficiency. The CodeQuest system, developed at the University of Oxford, was the first to exploit the observation that Datalog, a very restrictive version of logic programming, is in the sweet spot between expressive power and efficiency. The QL query language is an object-oriented version of Datalog.

Industrial

The early research works on querying the source of software programs spun off a number of industrial applications. In particular it became the cornerstone of systems for application intelligence (data mining on the source of software systems) and software renovation. In 2007, Paris-based CAST is one of the market leaders in that area, and other significant players include BluePhoenix in Herzliya, Israel. SemmleCode differs from these systems in its use of an object-oriented query language, which allows programmers to easily formulate new queries that are particular to their own project.

A full account of the academic and industrial developments leading up to the creation of SemmleCode can be found in a paper by Hajiyev et al.

Sample query in QL

To illustrate the use of QL, consider the well-known rule in object-oriented programming that public fields should be declared final. To find violations of that rule, we should search for fields that are public but not final. In QL, that requirement is expressed as follows:

Here not only is the offending field f selected, but also the package and type in which its declaration occurs.

Integration with development environments

SemmleCode provides a user interface via the Eclipse IDE to query Java code (both source code and bytecode) as well as XML files, and to edit QL queries. This is however but one application of the technology that underlies it: QL can be used to query any other type of complex data.

References

Semmle Wikipedia