Abstract (EN):
The partial derivative automaton (A(pd)) is usually smaller than other non-deterministic finite automata constructed from a regular expression, and it can be seen as a quotient of the Glushkov automaton (A(pos)). By estimating the number of regular expressions that have epsilon as a partial derivative, we compute a lower bound of the average number of mergings of states in A(pos) and describe its asymptotic behaviour. This depends on the alphabet size, k, and its limit, as k goes to infinity, is 1/2. The lower bound corresponds exactly to consider the A(pd) automaton for the marked version of the regular expression, i.e. where all its letters are made different. Experimental results suggest that the average number of states of this automaton, and of the A(pd) automaton for the unmarked regular expression, are very close to each other.
Language:
English
Type (Professor's evaluation):
Scientific
Contact:
sbb@ncc.up.pt; ajmachia@fc.up.pt; nam@ncc.up.pt; rvr@ncc.up.pt
No. of pages:
12