[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] console: generalize the ability for domU access

To: Jan Beulich <jbeulich@xxxxxxxx>
From: "Daniel P. Smith" <dpsmith@xxxxxxxxxxxxxxxxxxxx>
Date: Thu, 3 Aug 2023 08:56:49 -0400
Arc-authentication-results: i=1; mx.zohomail.com; dkim=pass header.i=apertussolutions.com; spf=pass smtp.mailfrom=dpsmith@xxxxxxxxxxxxxxxxxxxx; dmarc=pass header.from=<dpsmith@xxxxxxxxxxxxxxxxxxxx>
Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1691067414; h=Content-Type:Content-Transfer-Encoding:Cc:Date:From:In-Reply-To:MIME-Version:Message-ID:References:Subject:To; bh=Hk5uuj6GbfvXUbIcemYhrHAe2Y8Su9mvkhSFvHr3MmI=; b=UOPd/sya61Xz9MfC4eciz+QRKnn3vmcnknBMd0C6MiZXe8XBaV+tkP9WsJgeMzgahQHhLD4Fuhr+QTZIRisLETBhEEYXydaBIIpTueBe/G6c8bqz5ZVrPlQQ9WdtivD8CaiXl++jMfPWL6bS8i4bGuvUoOznX/k6dLDztSjNliY=
Arc-seal: i=1; a=rsa-sha256; t=1691067414; cv=none; d=zohomail.com; s=zohoarc; b=KdFwQfA9sqJfTbFOL7ydd5VRfe6vl9ajKNp8mMSJ1PsJw60e6AhGkJdlewcBPkFBTBGHMskmkMScGGlhq8MOZT3u7iBO9PnCoZqpbVDQsF0E/u5LIG+sBv9EZ6QC/bbpG7H7pFH2bGTdqgWf0iLE9a1EWQ1xvWS1qOMirOfSTdg=
Cc: Christopher Clark <christopher.w.clark@xxxxxxxxx>, Luca Fancellu <luca.fancellu@xxxxxxx>, Henry Wang <Henry.Wang@xxxxxxx>, Bertrand Marquis <bertrand.marquis@xxxxxxx>, Volodymyr Babchuk <Volodymyr_Babchuk@xxxxxxxx>, Stefano Stabellini <stefano.stabellini@xxxxxxx>, Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Roger Pau Monné <roger.pau@xxxxxxxxxx>, George Dunlap <george.dunlap@xxxxxxxxxx>, Julien Grall <julien@xxxxxxx>, Stefano Stabellini <sstabellini@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
Delivery-date: Thu, 03 Aug 2023 12:57:06 +0000
List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 8/2/23 07:01, Jan Beulich wrote:

On 01.08.2023 18:06, Daniel P. Smith wrote:

This patch reworks the console rotation logic to provide a general mechanism to
incorporate domU in to the rotation. It does so by walking the domain list
using the XSM console privlege check to determine if the domain is given access.

In reworking the rotation logic, the assumption that the hardware domain is the
first domain created is removed and is changed to explicitly locate the
hardware domain at boot.


I guess I'm unable to identify any "at boot only" code. I'm also
puzzled by this indicating there is a need to do so, when the global
variable "hardware_domain" is available, and you actually use it.

As to the "at boot only", there is a switch, which upon closer look Ihave broken and will now have to fix, that if the 2nd char of the'conswitch' boot param is 'x', then the console is not supposed to autoswitch to "dom0".

I was trying to keep the console_rx tracking logic the same, but I havealready diverted far from what was there, I might as well rewrite thatas well. Doing so would allow the value 0 can actually reflect domain id0 and not have to continue the off-by-one game played mapping domidfrom/to console_rx.

--- a/xen/drivers/char/console.c
+++ b/xen/drivers/char/console.c
@@ -473,45 +473,102 @@ static void cf_check dump_console_ring_key(unsigned char 
key)
   */
  static unsigned int __read_mostly console_rx = 0;

-#define max_console_rx (max_init_domid + 1)

+#define CON_RX_DOMID (console_rx - 1)

/* Make sure to rcu_unlock_domain after use */

  struct domain *console_input_domain(void)
  {
      if ( console_rx == 0 )
              return NULL;
-    return rcu_lock_domain_by_id(console_rx - 1);
+    return rcu_lock_domain_by_id(CON_RX_DOMID);
  }

static void switch_serial_input(void)

  {
-    unsigned int next_rx = console_rx;
+    struct domain *next, *d = console_input_domain();


Looks like "next" cannot be pointer-to-const just because the XSM hooks
still aren't properly const-ified. Oh well.


Something I can look at another point as XSM maintainer.

-    /*
-     * Rotate among Xen, dom0 and boot-time created domUs while skipping
-     * switching serial input to non existing domains.
-     */


While it would need adjustment, I don't think the comment wants deleting
altogether.


Ack.

-    for ( ; ; )
+    if ( d == NULL )


This covers both Xen receiving input and the domain receiving input having
gone away. Originally in the latter case the next sequential (in domid
numbering) domain would be switched to. In the new logic you start over
from the beginning of the domain list. Such a change in behavior (if
deemed acceptable at all, which I'm not convinced of) needs calling out in
the description.

No, the intent is to keep the existing behavior. Again, this is afallout from trying to track by domid with the assumptions that 1. onlyboot time constructed domains would be given access, 2. those domainswhere created with sequential domids and 3.) those domains never wentaway (or if they did, it would result in a dead spot in the rotation).As stated above, just dropping the pretense of tracking by domid willprovide a simpler solution and maintain the existing behavior, minus thepossibility of having dead spots in the rotation.

      {
-        struct domain *d;
+        if ( hardware_domain )
+        {
+            console_rx = hardware_domain->domain_id + 1;
+            printk("*** Serial input to DOM%d", CON_RX_DOMID);


Here and elsewhere - why %d when original code properly used %u? I also
think there are now quite a few too many of these all identical
printk()s.

Good question, I did not write the line, it was copy/paste fromelsewhere in the file and then continued to replicate from there.

+            goto out; //print switch_code statement & newline


Leftover development comment? (There's at least one more.)

Yes and no, the comment came from elsewhere in the file and early indevelopment it I place it here to later decide if it should stay (andget converted into a compliant comment). I will drop it in the nextiteration.

+        }
+        else


Please avoid "else" after an if() that ends in "return", "goto", or
alike.


Really? How would you propose handling common finalization when

completion happens during the execution of two branches of the logicalpurpose of the function? Do you want to see two separate if statementsof `if ( condition A )` and `if ( ! condition A )`?

+        {
+            for_each_domain(next)


What guarantees that the list won't change behind your back? You don't
hold domlist_read_lock here afaict. It might be that you're safe because
that lock is an RCU one and this function is only invoked at init time
or from some form of interrupt handler. But that's far from obvious and
will hence need both properly confirming and stating in a comment. (It
is actually this concern, iirc, which so far had us avoid iterating the
domain list here.)

It is better to error on the side of caution instead of assuming thiswill always be invoked in a safe manner. I will add a read lock for thedomain list.

+            {
+                if ( xsm_console_io(XSM_OTHER, next, CONSOLEIO_read) == 0 )
+                {
+                    console_rx = next->domain_id + 1;
+                    printk("*** Serial input to DOM%d", CON_RX_DOMID);
+                    goto out; //print switch_code statement & newline
+                }
+            }

- if ( next_rx++ >= max_console_rx )

+            console_rx = 0;
+            printk("*** Serial input to Xen");
+            goto out;
+        }
+    }
+
+    for ( next = rcu_dereference(d->next_in_list); next != NULL;
+          next = rcu_dereference(next->next_in_list) )


This looks like an open-coded continuation of for_each_domain() - I'm
afraid I'm wary of introducing anything like this.

Not exactly, for_each_domain() always starts with beginning of thedomain list and walks from that point forward. This open coded versionstats at domain d and walks from there to the end of the list. Which iswhy there is logic below, which uses for_each_domain(), to walk from thebeginning of the list until the next domain with console_io or thehardware domain, whichever occurs first.

What I did not want to do is potentially waste a lot of cycles doingfor_each_domain() with a continue until it reached the current domainand then start checking for the privilege.

I could take this and introduce a new macro, for_each_domain_from (or abetter name if there are suggestions) and use it here.

+    {
+        if ( hardware_domain && next == hardware_domain )
          {
              console_rx = 0;
              printk("*** Serial input to Xen");
-            break;
+            goto out;


Since you use "goto" anyway, this wants introducing a 2nd label (maybe
"xen"?) ahead of the identical code you add further down (immediately
ahead of the "out" label), to avoid code duplication.


Ack.

- d = rcu_lock_domain_by_id(next_rx - 1);

-        if ( d )
+        if ( xsm_console_io(XSM_OTHER, next, CONSOLEIO_read) == 0 )
          {
-            rcu_unlock_domain(d);
-            console_rx = next_rx;
-            printk("*** Serial input to DOM%u", next_rx - 1);
-            break;
+            console_rx = next->domain_id + 1;
+            printk("*** Serial input to DOM%d", CON_RX_DOMID);
+            goto out;
+        }
+    }
+
+    /*
+     * Hit the end of the domain list and instead of assuming that the
+     * hardware domain is the first in the list, get the first domain
+     * in the domain list and then if it is not the hardware domain or
+     * does not have console privilege, iterate the list until we find
+     * the hardware domain or a domain with console privilege.
+     */
+    if ( next == NULL )
+    {
+        for_each_domain(next)
+        {
+            if ( hardware_domain && next == hardware_domain )
+            {
+                console_rx = 0;
+                printk("*** Serial input to Xen");
+                goto out;
+            }
+
+            if ( xsm_console_io(XSM_OTHER, next, CONSOLEIO_read) == 0 )
+            {
+                console_rx = next->domain_id + 1;
+                printk("*** Serial input to DOM%d", CON_RX_DOMID);
+                goto out;
+            }
          }
      }

+ /*

+     * If we got here, could not find a domain with console io privilege.
+     * Default to Xen.
+     */


"Default to" is a little odd when there are no other options.


Fallback to?

+    console_rx = 0;
+    printk("*** Serial input to Xen");
+
+out:


Labels indented by at least one blank please.


Ack.

@@ -520,12 +577,11 @@ static void switch_serial_input(void)

static void __serial_rx(char c, struct cpu_user_regs *regs)

  {
-    switch ( console_rx )
-    {
-    case 0:
+    if ( console_rx == 0 )


By using CON_RX_DOMID everywhere else you try to carefully avoid and
open-coded assumptions on the precise biasing used there. With this
it would seem to me that here "CON_RX_DOMID > DOMID_MASK" would be
more in line with that model then.

Yep and it results in not correctly implementing the commandlineparameter behavior. Since I am already doing a domain list walk, insteadof having to convert from domids, this really should just be trackedwith a domain ref and use the rcu lock to catch if the domain drops atany point.

          return handle_keypress(c, regs);

- case 1:

+    if ( hardware_domain->domain_id == CON_RX_DOMID )


No check of hardware_domain against NULL?


Missed this one, will fix.

+    {
          /*
           * Deliver input to the hardware domain buffer, unless it is
           * already full.
@@ -538,31 +594,37 @@ static void __serial_rx(char c, struct cpu_user_regs 
*regs)
           * getting stuck.
           */
          send_global_virq(VIRQ_CONSOLE);
-        break;
-
-#ifdef CONFIG_SBSA_VUART_CONSOLE
-    default:
+    }
+    else
      {
-        struct domain *d = rcu_lock_domain_by_id(console_rx - 1);
+        struct domain *d = rcu_lock_domain_by_any_id(CON_RX_DOMID);

+ if ( d == NULL )

+            goto unlock_out;
+
+#ifdef CONFIG_SBSA_VUART_CONSOLE
          /*
           * If we have a properly initialized vpl011 console for the
           * domain, without a full PV ring to Dom0 (in that case input
           * comes from the PV ring), then send the character to it.
           */
-        if ( d != NULL &&
-             !d->arch.vpl011.backend_in_domain &&
+        if ( !d->arch.vpl011.backend_in_domain &&
               d->arch.vpl011.backend.xen != NULL )
+        {
              vpl011_rx_char_xen(d, c);
-        else
-            printk("Cannot send chars to Dom%d: no UART available\n",
-                   console_rx - 1);
+            goto unlock_out;
+        }
+#endif
+
+        if ( (serial_rx_prod - serial_rx_cons) != SERIAL_RX_SIZE )
+            serial_rx_ring[SERIAL_RX_MASK(serial_rx_prod++)] = c;


This is Dom0's buffer; I don't think sharing with DomU-s is correct.

I would disagree, it is the hypervisor's buffer that it decides to sharewith domains it trust. It just so happens that it always trusts thehardware domain. This is why I explicitly changed this to the XSM call,to express that when the system manager, by enabling this privilege onthe domain, has decided to trust these domains to have access to thehypervisor's buffer.

You also cannot ...

@@ -717,6 +779,8 @@ long do_console_io(
          rc = -E2BIG;
          if ( count > INT_MAX )
              break;
+        if ( CON_RX_DOMID != current->domain->domain_id )
+            return 0;

rc = 0;

          while ( (serial_rx_cons != serial_rx_prod) && (rc < count) )


... assume that by the time this hypercall is invoked input focus
hasn't switched. I think there's no way around a per-domain input
buffer, which of course would need setting up only for console-io-
capable domains.

Let's explore the exact concern here, the scenarios as far as I can seeit is as follows.

A person at the serial/console types keys for the current console domain(domA), then enters the console switch sequence, switching to anotherdomain (domB). DomA's CONSOLEIO_read hypercall arrives after the switchand thus is not sent the rx buffer contents. Then domB's CONSOLEIO_readarrives and then because `serial_rx_cons` and `serial_rx_prod` are notthe same, domB is sent the bytes that were intended for domA.

While a per domain console_io buffer would address this issue, I believethere is a simpler solution that can be extended depending on whether itis acceptable for the undelivered bytes to be dropped or not.

Simply upon switching, if serial_rx_cons and serial_rx_prod are set thesame, then no bytes will be leaked to domB from domA above. An extraprecaution could be taken to zero the serial_rx buffer. If guaranteeddelivery is desired, a list of buffer remnants could be drained onhypercall and console switching.

IMHO I believe the reality is while there is potential that the scenariocould happen, the probability is low. Doing a per domain buffer willalways incur the resource overhead even if the event never happens,while the above approach would only incur the resource overhead when thesituation occurs.

@@ -1107,7 +1171,7 @@ void __init console_endboot(void)
       * a useful 'how to switch' message.
       */
      if ( opt_conswitch[1] == 'x' )
-        console_rx = max_console_rx;
+        console_rx = 0;


I can't bring this change in line with the comment ahead of the if():
Won't this result in switch_serial_input() switching to Dom0?

Correct, if there is a Dom0 and it is the hardware domain. As noted, Imissed that I broke the behavior this was intended to create.


v/r.
dps

Follow-Ups:
- Re: [PATCH] console: generalize the ability for domU access
  - From: Jan Beulich

References:
- [PATCH] console: generalize the ability for domU access
  - From: Daniel P. Smith
- Re: [PATCH] console: generalize the ability for domU access
  - From: Jan Beulich

Prev by Date: Re: [PATCH] xenctrl_stubs.c: fix NULL dereference
Next by Date: [xen-4.16-testing test] 182107: tolerable FAIL - PUSHED
Previous by thread: Re: [PATCH] console: generalize the ability for domU access
Next by thread: Re: [PATCH] console: generalize the ability for domU access
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.