[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

AW: [Caml-list] Master-slave architecture behind an ocsigen server.

To: Anil Madhavapeddy <anil@xxxxxxxxxx>
From: Gerd Stolpmann <info@xxxxxxxxxxxxxxxxx>
Date: Thu, 28 Mar 2013 13:18:59 +0100
Cc: Martin Jambon <martin.jambon@xxxxxxxxxxxx>, caml users <caml-list@xxxxxxxx>, Philippe Veber <philippe.veber@xxxxxxxxx>, "cl-mirage@xxxxxxxxxxxxxxx List" <cl-mirage@xxxxxxxxxxxxxxx>, Alain Frisch <alain@xxxxxxxxx>
List-id: MirageOS development <cl-mirage.lists.cam.ac.uk>

Am 28.03.2013 12:02:46 schrieb(en) Anil Madhavapeddy:

On 28 Mar 2013, at 08:47, Alain Frisch <alain@xxxxxxxxx> wrote:

> On 03/28/2013 08:37 AM, Philippe Veber wrote:
>> Hi Martin,
>> nproc meets exactly my needs: a simple lwt-friendly interface to
>> dispatch function calls on a pool of processes that run on the same
>> machine. I have only one concern, that should probably bediscussed on>> the ocsigen list, that is I wonder if it is okay to fork theprocess>> running the ocsigen server. I think I remember warnings on havingparent>> and children processes sharing connections/channels but it'sreally not
>> clear to me.
>
> FWIW, LexiFi uses an architecture quite close to this for ourapplication. The main process manages the GUI and dispatchescomputations tasks to external processes. Some points to be noted:
>
> - Since this is a Windows application, we cannot rely on fork.Instead, we restart the application (Sys.argv.(0)), with specificcommand-line flag, captured by the library in charge of managingcomputations. This is done by calling a special function in thislibrary; the function does nothing in the main process and in thesub-processes, it starts the special mode and never returns. Thisgives a chance to the main application to do some globalinitialization common to the main and sub processes (for instance, wedynlink external plugins in this initialization phase).
>
> - Computation functions are registered as global values.Registration returns an opaque handle which can be used to call sucha function. We don't rely on marshaling closures.
>
> - The GUI process actually spawns a single sub-process (theScheduler), which itself manages more worker sub-sub-processes (witha maximal number of workers). Currently, we don't do very cleverscheduling based on task priorities, but this could easily be added.
>
> - An external computation can spawn sub-computations (by applying aparallel "map" to a list) either synchronously (direct style) orasynchronously (by providing a continuation function, which will beapplied to the list of results, maybe in a different process). Inboth cases, this is done by sending those tasks to the Scheduler.The Scheduler dispatches computation tasks to available workers. Inthe synchronous parallel map, the caller runs an inner event loop tocommunicate with the Scheduler (and it only accepts sub-tasks createdby itself or one of its descendants).
>
> - Top-level external computations can be stopped by the mainprocess (e.g. on user request). Concretely, this kills all workerscurrently working on that task or one of its sub-tasks.
>
> - In addition to sending back the final results, computations canreport progress to their caller and more intermediate results. Thisis useful to show a progress bar/status and partial results in theGUI before the end of the entire computation.
>
> - Communication between processes is done by exchanging marshaled"variants" (a tagged representation of OCaml values, generatedautomatically using our runtime types). Since we can attach specialvariantizers/devariantizers to specific types, this gives a chance tocustomize how some values have to be exchanged between processes(e.g. values relying on internal hash-consing are treated speciallyto recreate the maximal sharing in the sub-process).
>
> - Concretely, the communication between processes is done throughqueues of messages implemented with shared memory. (This componentwas developed by Fabrice Le Fessant and OCamlPro.) Largecomputation arguments or results (above a certain size) are stored onthe file system, to avoid having to keep them in RAM for too long (ifall workers are busy, the computation might wait for some time beingstarted).
Are all of the messages through these queues persistent, or just thelarger ones that are too big to fit in the shared memory segment, andare they always point-to-point streams?
We've got a similar need in Xen/Mirage for shared memorycommunication and queues, and have been breaking them out intostandalone libs such as:
https://github.com/djs55/shared-memory-ring
...which is ABI-compatible with the existing Xen shared memoryinterfaces, and also an OCaml version of the transport-agnostic APIsketched out in:
http://anil.recoil.org/papers/2012-resolve-fable.pdf

Interesting that there are now other shared memory implementations forOCaml. Note that there are a number of them in Ocamlnet, with somespecialities not yet mentioned. There is the Netcamlbox libraryproviding message boxes of limited size for exchanging OCaml valuesdirectly. That means the value is copied to the shared memory block bythe sender, and the receiver can pick it up there without copying itagain. Sender and receiver can map the memory at different addresses(the copy procedure invoked by the sender takes care of possibleoffsets, so that that Netcamlbox also allows the communication betweenprocesses that don't have a fork relation). There is no need formarshalling the value.


http://projects.camlcity.org/projects/dl/ocamlnet-3.6.3/doc/html-main/Netcamlbox.html

Going even beyond that, Netmulticore implements an "ancient" heap inshared memory (like Richard's Ancient lib, but with more options). Thisheap is organized like OCaml's major heap, and there is even a GCimplementation for it. There are a number of data structures (arrays,hash tables, queues, buffers) which are aware of residing in sharedmemory. For synchronization there are mutexes, semaphores and conditionvariables. So far the values to manipulate are already in sharedmemory, programming with Netmulticore feels a lot like programming withmulti-threading. In practice, however, you need to frequently copyvalues in and out, so it is not exactly as convenient. ForNetmulticore, all processes must map the shared memory to the sameaddress (easy with "fork").


http://projects.camlcity.org/projects/dl/ocamlnet-3.6.3/doc/html-main/Intro.html#netmulticore
http://projects.camlcity.org/projects/dl/ocamlnet-3.6.3/doc/html-main/Netmcore_tut.html

The missing link currently is the persistent queuing service, butwe're investigating the options here (ocamlmq looks rather nice).


There is also Netamqp, which can be used together with RabbitMQ.

http://projects.camlcity.org/projects/netamqp.html

Gerd

-anil


--
Caml-list mailing list.  Subscription management and archives:
https://sympa.inria.fr/sympa/arc/caml-list
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
Bug reports: http://caml.inria.fr/bin/caml-bugs




--
------------------------------------------------------------
Gerd Stolpmann, Darmstadt, Germany    gerd@xxxxxxxxxxxxxxxxx
Creator of GODI and camlcity.org.
Contact details:        http://www.camlcity.org/contact.html
Company homepage:       http://www.gerd-stolpmann.de
------------------------------------------------------------

References:
- Re: [Caml-list] Master-slave architecture behind an ocsigen server.
  - From: Anil Madhavapeddy

Prev by Date: Re: mirage + froc = self-scaling?
Next by Date: Re: mirage + froc = self-scaling?
Previous by thread: Re: [Caml-list] Master-slave architecture behind an ocsigen server.
Next by thread: mirage + froc = self-scaling?
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.