[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] RE: [Xen-ia64-devel] Console problem on domU on tip?
I guest maybe the firmware on your machine doesn't implement this pal call due to there is no split I/D cache at that time, so when you call this pal call, it will return PAL_STATUS_UNIMPLEMENTED, Could you please turn on CONFIG_IA64_SPLIT_CACHE and try this new patch to see whether your machine can boot domain0? If this patch works, could you please remove all CONFIG_IA64_SPLIT_CACHE macro? Thanks -Anthony >-----Original Message----- >From: Magenheimer, Dan (HP Labs Fort Collins) [mailto:dan.magenheimer@xxxxxx] >Sent: 2005年12月19日 23:48 >To: Xu, Anthony; Tian, Kevin; xen-ia64-devel@xxxxxxxxxxxxxxxxxxx >Subject: RE: [Xen-ia64-devel] Console problem on domU on tip? > >I have been distracted tracking another bug... > >Here's where I got: > >The machine is a new (April 2005) HP rx2620 so it is >not old firmware. I can't reproduce it on a machine >with an ITP (which does have older firmware). > >This PAL call is never used in Linux, though there is a >routine coded for it. It is the only >PAL call coded in Linux that occurs with psr.ic off. > >The crash I am seeing occurs either during the PAL call or >immediately upon return. > >Is it OK to > > >> -----Original Message----- >> From: Xu, Anthony [mailto:anthony.xu@xxxxxxxxx] >> Sent: Monday, December 19, 2005 2:02 AM >> To: Tian, Kevin; Magenheimer, Dan (HP Labs Fort Collins); >> xen-ia64-devel@xxxxxxxxxxxxxxxxxxx >> Subject: RE: [Xen-ia64-devel] Console problem on domU on tip? >> >> Dan, >> Have you got time to verify below discussion? >> >> Thanks >> -Anthony >> >> >-----Original Message----- >> >From: Tian, Kevin >> >Sent: 2005年12月16日 10:16 >> >To: Xu, Anthony; 'Magenheimer, Dan (HP Labs Fort Collins)'; >> >'xen-ia64-devel@xxxxxxxxxxxxxxxxxxx' >> >Subject: RE: [Xen-ia64-devel] Console problem on domU on tip? >> > >> >>From: Xu, Anthony >> >>Sent: 2005年12月16日 9:54 >> >> >> >>>Also, why panic if it fails? >> >>> >> > >> >Panic is not required here, and we could just print out a >> warning message. >> >Previously panic is kept there to help our debug in early stage. >> > >> >> >> >> >> >>>Does the problem happen only on VTI? Or both VTI and non-VTI on >> >>>split-cache machines? >> >> >> >>Sometimes, it makes domain0 crash at the very beginning of >> the domain0 boot >> >>process, especially on MP machine. >> >> >> >> >> >>Thanks >> >>-Anthony >> > >> >One complement is, that problem definitely exists on new >> split-cache processors, >> >for dom0/domU. For VTI domain, we have logic within device >> model to ensure >> >consistence. >> > >> >Thanks, >> >Kevin >> >> >> >> >> >>>-----Original Message----- >> >>>From: Magenheimer, Dan (HP Labs Fort Collins) >> >>[mailto:dan.magenheimer@xxxxxx] >> >>>Sent: 2005年12月16日 1:39 >> >>>To: Tian, Kevin; xen-ia64-devel@xxxxxxxxxxxxxxxxxxx >> >>>Cc: Xu, Anthony >> >>>Subject: RE: [Xen-ia64-devel] Console problem on domU on tip? >> >>> >> >>>> >Is this code fragment necessary for VTI to boot domU >> >>>> >or is it OK to remove? >> >>>> >> >>>> The comment is inaccurate and it should be domU. That I/D cache >> >>>> sync step is mandatory to boot domU on new IA64 >> processor which has >> >>>> split L2 I/D cache. If without such I/D cache sync, control >> >>>> panel loads >> >>>> domU's kernel image which only affects D side cache. If >> there're some >> >>>> stale entry on I-side cache within same range of dom0 image, >> >>>> people will >> >>>> see machine going weird. >> >>> >> >>>I don't understand... how can there be stale entries in >> the I-cache? >> >>>The instructions have just been written to memory (through D-cache) >> >>>and no instructions in this domain have yet been executed. >> >>>I do see that the D-cache needs to be flushed so that memory is >> >>>coherent but are there better ways to do that without a pal call? >> >>> >> >>>> Normally I/D cache sync shouldn't force any problem. Possibly >> >>>> there's some problem with the pal calling code, like >> incorrect ITLB >> >>>> mapping for pal or similar issue... >> >>> >> >>>Although the ia64_pal_cache_flush routine is defined in >> linux's pal.h, >> >>>it doesn't appear to be used anywhere in Linux so there is no use >> >>>model to copy. I suspect there is some use model for the call that >> >>>we don't understand, for example maybe it should only be >> called with >> >>>physical &progress? It definitely fails every time on one of >> >>>my (newer) machines and disabling the pal call makes the problem >> >>>go away. >> >>> >> >>>> Though it's intermittent, please >> >>>> keep this code >> >>>> there for correctness. >> >>> >> >>>Since the call is definitely failing under some circumstances >> >>>that we don't understand, I'm inclined to at least put the code >> >>>in an #ifdef CONFIG_SPLIT_CACHE >> >>> >> >>>Does the problem happen only on VTI? Or both VTI and non-VTI on >> >>>split-cache machines? >> >>> >> >>>Thanks, >> >>>Dan >> >>> >> >>>P.S. I tried Anthony's patch (which moves the PAL call after >> >>>new_thread()) but it still crashes. >> Attachment:
idcache.patch _______________________________________________ Xen-ia64-devel mailing list Xen-ia64-devel@xxxxxxxxxxxxxxxxxxx http://lists.xensource.com/xen-ia64-devel
|
Lists.xenproject.org is hosted with RackSpace, monitoring our |