Re: Text properties slow down XEmacs

Tuesday, 8 September 1998

        Hrvoje Niksic <hniksic(a)srce.hr&gt; writes:

...
 I'm more and more convinced that our extent-based text
properties
 interface is what is slowing XEmacs down for most practical purposes. 
Are you using a mule or a nonmule XEmacs? 

...

 Naturally, I count fontification of large buffers and Summary buffer
 creation as "practical purposes."  Profiles of Summary creation have
 been published in the past so there is no need to rehash that.  Let's
 look at a fontification profile instead.

 Where would you expect most time is being spent during fontification?
 My bet would be regexp-matching.  Well, wrong! 
It depends I guess. This is from fontifing events.c

Function Name                        Ticks    %/Total   Call Count
=================================    =====    =======   ==========
re-search-forward                    612      47.332    6960
put-nonduplicable-text-property      186      14.385    12140
map-extents                          94       7.270     7962
font-lock-fontify-keywords-region    87       6.729     2
syntactically-sectionize             74       5.723     2
(in garbage collection)              65       5.027     
looking-at                           52       4.022     8386
font-lock-match-c++-style-declaration-item-and-skip-to-next    28       2.166     4626
store-match-data                     14       1.083     4669
extent-property                      14       1.083     6616
match-data                           13       1.005     4669
byte-code                            12       0.928     2095

on a 21.0 Mule. Here is from a local 70K TeX file on 20.4-latin1

Function Name                              Ticks    %/Total
=======================================    =====    =======
put-nonduplicable-text-property            736      43.914
map-extents                                311      18.556
re-search-forward                          250      14.916
font-lock-fontify-keywords-region          236      14.081
extent-property                            24       1.432 
(in garbage collection)                    22       1.313 
syntactically-sectionize                   20       1.193 

...
 So, by *far* the most time (35%) is getting spent in the text-props
 code, in an XEmacs compiled *without* error checking. 
I was really surprised at the call counts for
put-nonduplicable-text-property. 

...
 I suspect this
 code is much faster in FSF Emacs because they support text properties
 directly through the use of balanced trees, while we emulate them
 using non-overlapping extents, which are stored as a simple ordered
 list in the buffer. 
AFAICS can see put-nonduplicable-text-property seems to hit a worst
case in in map_extents_bytind. It should be really trivial/fast to find the 
extents overlapping a certain region, especially when there only
non-overlapping extents..

It would be interesting to see where the time is spent in
map_extents_bytind. Note this in extents.c

   If the lists were stored as balanced binary trees, then operation (1)
   would take logarithmic time, which is usually quite fast.  However,
   currently they're stored as simple doubly-linked lists, and instead
   we do some caching to try to speed things up.

It seems as though the caching is not working in all cases. Maybe the
linked list should be replaced by a (splay) tree or a skip list.

...
 It's kind of sad that all of our enhancements to font-lock are
getting 
 nullified by these problems.  :-( 
Why is font-lock using text-properties anyway? Isn't what it does much 
more naturally mapped on extents.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998