[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Orekit Developers] Multi-threading problems in OREKIT

To: orekit-developers@orekit.org
Subject: RE: [Orekit Developers] Multi-threading problems in OREKIT
From: MAISONOBE Luc <luc.maisonobe@c-s.fr>
Date: Mon, 20 Feb 2012 22:22:54 +0100
In-reply-to: <20120209222347.71861kyzh8s85sv7@messagerie.si.c-s.fr>
References: <20120209222347.71861kyzh8s85sv7@messagerie.si.c-s.fr>
User-agent: Internet Messaging Program (IMP) H3 (4.3.9)

Hello,

MAISONOBE Luc <luc.maisonobe@c-s.fr> a écrit :

Hello,
Here are some thoughts about the multi-threading issues and thedirection we are aiming at. We have tried to combine requests fromseveral different users (some have been expressed publicly in thislist or in the bug tracker, some have been directed directly to theOrekit team). This is mainly a trade-off between various threads andincludes contributions and ideas from several persons.
I would like to present it here so everyone interested can givetheir opinion about it.

Some work has been done on this topic and committed to the forge in adedicated branch. See TimeStampedGenerator and TimeStampedCache. Fornow, no classes use these features.


We would be happy to get some feedback about this.

best regards,
Luc

The ideas are based on the reasons why we have some singletons inOrekit. Singletons are used for caching and are a really importantfeature. Without caching for example, the performances for Earthprecession-nutation computation would be really really poor.
Many server-based applications that use multi-threading now do nothandle Threads by themselves anymore (as was the case in the Java 4era). They rather use Javas 5 ExecutorService, and in most cases theimplementation they use are thread pools provided by the factorymethod Executors.newFixedThreadPool. Such thread pools do *not*guarantee the same thread will serve all requests from a specificclient. In fact, when a request is submitted to the executorservice, a random idle thread is waken up to execute it and oncecompleted the thread is put again in the idle pool where it can bepicked up later. Threads are not started or stopped, they arerecycled. There are no correlations at all between threads andrequests and they are bound together almost at random. This clearlyrules out ThreadLocal singletons as their caching is very likely tobe invalidated in this case (see below). In fact, using ThreadLocalputs a very stringent assumption on how the application handles itsthreads so it cannot be reliably used in a general purpose libraryof intermediate level like Orekit. It is safe only at applicationlevel or at dedicated support library level where the globalarchitecture is known in advance.
Some people have also proposed using memcached to handle such data.Memcached is for sure a complete solution, but seems rather big andwould not scale down. It would also be a huge dependency for Orekit,so this solution was also discarded.
So we have some shared data cache we can't remove, and we mustassume threads and requests are not bound together, and we don'twant our cache to be invalidated at each request.
However, there is still hope. We can assume that a server wouldserve only a small number of remote clients (say a dozen orsomething like that) and that each client will issue requests thatdo have some temporal locality, i.e. they all correspond to somereduced time range. One very important property shared by oursingletons is that they are all date based. This is the major trickwe used in the following design.
As an example, we consider the following use case. Lets consider atypical operational ground system where several engineers work inparallel. At some time, we may have application A performing somecomputation for orbit determination on data centered around lastweek, application B computing next cycle maneuvers for the next 6months, application C doing some history browsing on data coveringlast year, and application D doing real time monitoring on currentdate. In this case, we have four applications each using a differenttime range. If these application use a shared central server toperform conversions between Earth and inertial frames for example,the server will get random requests in these four time ranges only,and will need to cache very large Earth Orientation Parameter datain each case.
The current version of Orekit fails miserably in this case asprecession/nutation computations are cached in a singleton and thecache covers only a few days. So when a conversion request from oneapplication is served just after a request from another application,the new request appears to be out of current cache range, so cacheis invalidated and the complete cache must be recomputed (which iscomputing intensive). Then the request is answered, and the cachemay be invalidated again just after that. We get cache misses allthe time and the cache finally hinders performances a lot instead ofimproving them.
So what do we propose to solve this problem?
We propose to set up a cache dedicated to sequences of TimeStampedobjects that would handle a small number of disjoint ranges at thesame time (in our use case, four ranges would be needed). This cachemust be thread safe and it must store large data sets. The cachewould be created empty and be able to add new entries. The variousranges would be allocated as needed up to the user-configured maxnumber. Ranges could be configured with a max number of points ormax duration span to avoid huge memory consumption. Typically, areal-time monitoring could set up ranges limited to a few hours usedas a rolling range as time flows, but station-keeping simulatorswould ask for almost month-long ranges as they go back and forth inthe station-keeping cycle trying to optimize maneuvers. Entrieswithin a single range would be invalidated in rolling cyclesdepending on request dates (inserting points at one side wouldinvolve dropping points at the other side, taking care time-localityis reversed according to forward or backward propagation). When anew range is needed, it would be allocated up to the max numberwithout invalidating other ranges, but if the max number of rangesis exceeded (which would be a user configuration mistake), thenanother range would be dropped, using some standard cache policy,typically LRU (Least Recently Used).
We propose to use a single class for all these caches:

 public class TimeStampedCache<T extends TimeStamped> {
 }
The T parameter would be the type of data cached (UTCTAIOffset forthe UTC scale, or similar classes for EOP or planetary ephemeris).The cache would only provide thread safety, ranges handling, entrieshandling within the ranges. It would not provide any computation onthe entries.
This class would be used by higher level objects which would notstore the complete data, but only a few reference to the elementsthey currently need. For example an object that would need simple orno interpolation like UTC-TAI would store only two references to theprevious point and next point, whereas an object that would needhigher degree polynomial interpolation like precession/nutationwould store a sub-array of a dozen references. These objects beingsmall, they could be reallocated, copied, made immutable if desired,depending on the need. They would rely on the TimeStampedCachemethods to retrieve the references. Three methods are foreseen forthis in the TimeStampedCache class:
 T[] getNeighbors(final AbsoluteDate central, final int n)
    throws OrekitException;

 T getBefore(final AbsoluteDate date)
    throws OrekitException;

 T getAfter(final AbsoluteDate date)
    throws OrekitException;
This is a follow-on on the idea to have small local data in smallobjects that can exist in many instances, and to have the big cachein a singleton. So the small objects could avoid handlingmulti-threading if they are built to be immutable for example, orthey could handle simple synchronization or locking if they aremutable but rely on an already thread-safe class to retrieve theirdata set.
In order for the TimeStampedCache class to populate its ranges asrequests arrive, it needs some way to compute the entry points. Thisis done by providing the class with a generator at build time. Agenerator is simply an implementation of the following interface:
public interface TimeStampedGenerator<T extends TimeStamped> {

    /** Generate an entry to be cached.
     * @param date date at which the entry should be generated
     * (may be null if no entry can be generated)
     * @exception OrekitException if entry generation is attempted but fails
     */
    T generate(AbsoluteDate date) throws OrekitException;

}
Note that some dedicated logic is already foreseen to cope with timeranges that can never be extended (like leap seconds which did notexist prior to 1972) but should not trigger errors if a request ismade far in the past, and to also cope with time ranges that shouldtheoretically be extensible but due to missing data cannot providepoints at some requested datrd and should trigger an error. This isan implementation detail we will not describe in this message.
With this setting, a cache for UTC-TAI would provide a generatorthat is based on reading the UTC-TAI history file, a cache forplanetary ephemeris would provide a generator that is based onreading JPL or IMCCE files, a cache for precession/nutation wouldprovide a generator that is a self-contained computation using IERSconventions.
Another point worth mentioning is that TimeStampedCache could handlemulti-threading using the standard ReentrantReadWriteLock from theconcurrency package instead of synchronzed blocks. Such locks allowmultiple read at the same time, which is OK and scales well. Theyguard against multiple writes (i.e. cache extension or invalidation)which should never happen simultaneously.
So here are our current thoughts about this. We would like to knowwhat other people think about this.
Best regards,
Luc

----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.




----------------------------------------------------------------
This message was sent using IMP, the Internet Messaging Program.

Follow-Ups:
- Re: [Orekit Developers] Multi-threading problems in OREKIT
  - From: Thomas Neidhart <thomas.neidhart@gmail.com>

References:
- RE: [Orekit Developers] Multi-threading problems in OREKIT
  - From: MAISONOBE Luc <luc.maisonobe@c-s.fr>

Prev by Date: RE: [Orekit Developers] Multi-threading problems in OREKIT
Next by Date: Re: [Orekit Developers] Multi-threading problems in OREKIT
Previous by thread: RE: [Orekit Developers] Multi-threading problems in OREKIT
Next by thread: Re: [Orekit Developers] Multi-threading problems in OREKIT
Index(es):
- Date
- Thread