[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features

To: Juergen Gross <jgross@xxxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
From: Julien Grall <julien@xxxxxxx>
Date: Mon, 8 Aug 2022 12:00:18 +0100
Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxx>, Jan Beulich <jbeulich@xxxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>
Delivery-date: Mon, 08 Aug 2022 11:00:42 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

Hi Juergen,

On 08/08/2022 07:33, Juergen Gross wrote:

On 04.08.22 21:28, Julien Grall wrote:

On 03/08/2022 12:59, Juergen Gross wrote:

Extend the definition of the Xenstore migration stream to cover new
features:

- per domain features
- extended watches (watch depth)
- per domain quota

Signed-off-by: Juergen Gross <jgross@xxxxxxxx>
---
V3:
- new patch
---
  docs/designs/xenstore-migration.md | 85 ++++++++++++++++++++++++++++--
  1 file changed, 82 insertions(+), 3 deletions(-)

diff --git a/docs/designs/xenstore-migration.mdb/docs/designs/xenstore-migration.md

index efa526f420..b2b1d3d5c7 100644
--- a/docs/designs/xenstore-migration.md
+++ b/docs/designs/xenstore-migration.md
@@ -43,7 +43,13 @@ the setting of the endianness bit.
  |-----------|---------------------------------------------------|
  | `ident`   | 0x78656e73746f7265 ('xenstore' in ASCII)          |
  |           |                                                   |
-| `version` | 0x00000001 (the version of the specification)     |
+| `version` | The version of the specification, defined values: |
+|           | 0x00000001: all fields without any explicitly     |
+|           |             mentioned version dependency are      |
+|           |             valid.                                |
+|           | 0x00000002: all fields valid for version 1 plus   |
+|           |             fields explicitly stated to be        |
+|           |             supported in version 2 are valid.     |

I am a bit concerned with the bump of the versions. It means, it willbe necessary for Xenstored to know whether the new Xenstored speaks v1or v2. This is less an issue when Live-Migration (although there is afleet management problem) but it will be one for Live-Update if weneed to rollback.

So I am wondering if we can avoid to bump the version and use othermeans to detect the difference.


In the end this is exactly what the version was meant to be used for.

I think it would make much more sense to think about the way to handle a
bump of the version in a compatible way.

My idea was to add a xenstored command line parameter for limiting the
migration stream version to be used to a specified version, causing new
features probably to not be available, though.

I think this is fine. Someone that cares about rollback will also likelycare about fleet diversity. So they will want to avoid enabling afeature until they know it can work everywhere.


I don't see how e.g. a rollback would be doable in case a domain already
started to use a new feature like the third parameter when setting a watch.
Even if we'd drop the depth information when rolling back a watch set
afterwards with an additional depth added would be rejected by the older
xenstored, which would be unexpected failure for the guest.


See above.


It might make sense to try to use the V1 stream when doing a live update,
e.g. covering the case when the SET_FEATURE command was used for each
active guest to limit the features to V1 compatible ones. A force parameter
might be used to use the V1 stream even if guests are using V2 features,
risking breakage of those guests.

I don't have a strong opinion on this yet. I might have some when seenthe code :).


[...]

This would even be possible using the global record of V1, as
the length information of the record allows to add new fields without
having to bump the version.

I was actually thinking about this when writing the e-mail last week.There are no dynamic length array in the global records so far, so usingthe length information would be ok. I am more concerned about the othersbecause we are mixing fixed and dynamic length.


This means it is more difficult to read the code and the layout.

+| `n-glob-quota` | Number of quota values which apply globally  |
+|                | only. Valid only for version 2 and later.    |
+|                |                                              |
+| `quota-val`    | Quota values, first the ones applying per    |
+|                | domain, then the ones applying globally. A   |
+|                | value of 0 has the semantics of "unlimited". |
+|                | Valid only for version 2 and later.          |
+|                |                                              |
+| `quota-names`  | 0 delimited strings of the quota names in    |
+| | the same sequence as the `quota-val` values. | >+| | Valid only for version 2 and later. |
From my understanding, both version of Xenstored needs to agree onthe quota names. So it means the name have to be defined as part ofthe spec. At which point, I think it would be better to use ID.


I don't think so. For one the minimal set of quota names has been defined
already in patch 3.

Someone reading the migration stream will not necessarily read theXenstore protocol. So I think we should either make them explicit in thedocumentation or have a link to the other document.

And even with using an ID you'd have the same problem
again, but without having the possibility to add variant specific quota


Fair enough.

(remember that there already has been a statement that doing a live update
from C to OCAML or vice versa would probably break users due to some
deviations in behavior)

I can't find such statement in public documentation. Do you have a link?

That said, a guest doesn't have a (easy?) way to know how Xenstored isimplemented. So it is quite concerning to hear some of them may rely onbehaviors. How did that happen?

Also, can you clarify what would happen if the stream contains a quotanot supported by the new Xenstored?


Yes, I'll add a sentence that those should be ignored.

xenstored will resume in the original process context. Hence`rw-socket-fd` simply specifies the file descriptor of the socket. Sockets are notalways

@@ -145,7 +177,7 @@ the domain being migrated.
  ```
      0       1       2       3       4       5       6       7    octet
  +-------+-------+-------+-------+-------+-------+-------+-------+
-| conn-id                       | conn-type     |               |
+| conn-id                       | conn-type     | n-quota       |
  +-------------------------------+---------------+---------------+
  | conn-spec
  ...
@@ -154,6 +186,17 @@ the domain being migrated.
  +---------------+---------------+-------------------------------+
  | data
  ...
++-------------------------------+
+| features                      |
++-------------------------------+
+| quota-val 1                   |
++-------------------------------+
+...
++-------------------------------+
+| quota-val N                   |
++-------------------------------+
+| quota-names
+...
  ```
@@ -167,6 +210,10 @@ the domain being migrated.
  |                | 0x0001: socket                               |
  |                | 0x0002 - 0xFFFF: reserved for future use     |
  |                |                                              |
+| `n-quota`      | Number of quota values.                      |
+|                | Only for `conn-type` 0 (shared ring).        |
+|                | Only valid for version 2 and later.          |
+|                |                                              |
  | `conn-spec`    | See below                                    |
  |                |                                              |
  | `in-data-len`  | The length (in octets) of any data read      |
@@ -182,6 +229,22 @@ the domain being migrated.
  | `data`         | Pending data: first in-data-len octets of    |
  |                | read data, then out-data-len octets of       |
  |                | written data (any of both may be empty)      |
+|                |                                              |
+| `features`     | Value of the feature field visible by the    |
+|                | guest at offset 2064 of the ring page.       |
+|                | Aligned to the next 4 octet boundary.        |
+|                | Only for `conn-type` 0 (shared ring).        |

For the purpose of the stream, I would consider to make it availablefor the socket connection. This could potentially be used in thefuture to allow each application to have a different behavior whensocket is used.


This would break the use of xenstore-stubdom for such a setup.

I am not sure why it would break the use of xenstore-stubdom. Anapplication will already need to cope with the case Xenstored doesn'tsupport a feature.

At which point, it would be easy to say "I don't want this feature" whenusing a socket.

I can't make my mind yet if we can avoid bumping the version for thisfield. What would happen if we need to rollback?
I think an active usage of the new features and a rollback are mutually
exclusive. See above.
+|                |                                              |
+| `quota-names`  | 0 delimited strings of the quota names in    |
+|                | the same sequence as the `quota-val` values. |
+|                | Only for `conn-type` 0 (shared ring).        |
+|                | Only valid for version 2 and later.          |
As for the "global" quotas, I would move the quotas in a separaterecord. In this case, this would also be useful to avoid having maydynamic length field within the same record.
I like having the data together more.

Which is fine so long the code doesn't become too horrible toread/maintain. I think having dynamic length array in the middle of therecord makes it trickier.

This will only become worse as we introduce new fields in newerrevision. So at which point would you say the record has grown too much?

To me, this is already the point and we have plenty of record ID tohandle that.

In case of live update the connection record for the connection viawhich the live update command was issued will contain the response forthe live@@ -247,7 +310,7 @@ by a connection for which there is`CONNECTION_DATA` record previously present.

  ```
      0       1       2       3    octet
-+-------+-------+-------+-------+
++---------------+---------------+
  | conn-id                       |
  +---------------+---------------+
  | wpath-len     | token-len     |

@@ -256,6 +319,9 @@ by a connection for which there is`CONNECTION_DATA` record previously present.

  ...
  | token
  ...
++---------------+---------------+
+| depth         |               |
++---------------+---------------+
  ```

@@ -275,6 +341,13 @@ by a connection for which there is`CONNECTION_DATA` record previously present.

  |             |                                                 |
  | `token`     | The watch identifier token, as specified in the |
  |             | `WATCH` operation                               |
+|             |                                                 |
+| `depth`     | The number of directory levels below the        |
+|             | watched path to consider for a match. This      |
+|             | field is aligned to the next 4 octet boundary.  |
+|             | A value of 0xffff is used for unlimited depth.  |
+|             | This field is valid only for version 2 and      |
+|             | higher.                                         |

If we are going to bump the stream version, then I think we shouldmove the field before token/path.

I thought about that, but liked it better to be able to keep a commonstruct

layout for the record with the V2 fields being at the end.

Main reason is the ability to avoid duplication of code for being able to
handle both versions.

The cons is you can't easily describe the record in "struct ...". As Iwrote above, I think have dynamic length array in the middle of a recordis wrong.

I have looked at the code, I don't think there will be enough codeduplication to warrant adding fixed field at the end of the record.


Cheers,

--
Julien Grall

Follow-Ups:
- Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
  - From: Juergen Gross

References:
- [PATCH v3 0/5] tools/xenstore: add some new features to the documentation
  - From: Juergen Gross
- [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
  - From: Juergen Gross
- Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
  - From: Julien Grall
- Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
  - From: Juergen Gross

Prev by Date: Re: [PATCH] xen/arm: regs: Fix MISRA C 2012 Rule 20.7 violation
Next by Date: [XEN PATCH 0/2] libxl: replace deprecated -sdl and -soundhw qemu options
Previous by thread: Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
Next by thread: Re: [PATCH v3 5/5] tools/xenstore: add migration stream extensions for new features
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.