[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Xen-devel] [v8][PATCH 07/16] hvmloader/e820: construct guest e820 table



On 07/16/2015 04:29 PM, Chen, Tiejun wrote:
> 
> 
> On 2015/7/16 23:16, George Dunlap wrote:
>> On 07/16/2015 04:04 PM, Chen, Tiejun wrote:
>>>> Yes, sorry, add_high_mem will be the size of memory *relocated*, not
>>>> the actual end of it (unless, as you say, the original highmem region
>>>> didn't exist).
>>>>
>>>> What I really meant was that either way, after adjusting the highmem
>>>> region in the e820, the end of that region should correspond to
>>>> hvm_info->high_mem_pgend.
>>>>
>>>> What about something like this?
>>>> ---
>>>>       /*
>>>>        * And then we also need to adjust highmem.
>>>>        */
>>>>       if ( add_high_mem )
>>>>       {
>>>>           /*
>>>>            * Modify the existing highmem region if it exists
>>>>            */
>>>>           for ( i = 0; i < nr; i++ )
>>>>           {
>>>>               if ( e820[i].type == E820_RAM &&
>>>>                    e820[i].addr == (1ull << 32))
>>>>               {
>>>>                   e820[i].size += add_high_mem;
>>>>                   break;
>>>>               }
>>>>           }
>>>>
>>>>           /*
>>>>            * If we didn't find a highmem region, make one
>>>>            */
>>>>           if ( i == nr )
>>>>           {
>>>>               e820[nr].addr = ((uint64_t)1 << 32);
>>>>               e820[nr].size = e820[nr].addr + add_high_mem;
>>>>               e820[nr].type = E820_RAM;
>>>>               nr++;
>>>>           }
>>>>
>>>>           /*
>>>>            * Either way, at this point i points to the entry containing
>>>>            * highmem.  Compare it to what's in hvm_info as a sanity
>>>>            * check.
>>>>            */
>>>>           BUG_ON(e820[i].addr+e820[i].size !=
>>>>                  ((uint64_t)hvm_info->high_mem_pgend << PAGE_SHIFT));
>>>>       }
>>>>
>>>
>>> Looks really better.
>>>
>>> I just introduce a little change based on yours, and I post this as a
>>> whole,
>>>
>>> diff --git a/tools/firmware/hvmloader/e820.c
>>> b/tools/firmware/hvmloader/e820.c
>>> index 7a414ab..8c9b01f 100644
>>> --- a/tools/firmware/hvmloader/e820.c
>>> +++ b/tools/firmware/hvmloader/e820.c
>>> @@ -105,7 +105,10 @@ int build_e820_table(struct e820entry *e820,
>>>                        unsigned int lowmem_reserved_base,
>>>                        unsigned int bios_image_base)
>>>   {
>>> -    unsigned int nr = 0;
>>> +    unsigned int nr = 0, i, j;
>>> +    uint32_t low_mem_end = hvm_info->low_mem_pgend << PAGE_SHIFT;
>>> +    uint64_t high_mem_end = (uint64_t)hvm_info->high_mem_pgend <<
>>> PAGE_SHIFT;
>>> +    uint64_t add_high_mem = 0;
>>>
>>>       if ( !lowmem_reserved_base )
>>>               lowmem_reserved_base = 0xA0000;
>>> @@ -149,13 +152,6 @@ int build_e820_table(struct e820entry *e820,
>>>       e820[nr].type = E820_RESERVED;
>>>       nr++;
>>>
>>> -    /* Low RAM goes here. Reserve space for special pages. */
>>> -    BUG_ON((hvm_info->low_mem_pgend << PAGE_SHIFT) < (2u << 20));
>>> -    e820[nr].addr = 0x100000;
>>> -    e820[nr].size = (hvm_info->low_mem_pgend << PAGE_SHIFT) -
>>> e820[nr].addr;
>>> -    e820[nr].type = E820_RAM;
>>> -    nr++;
>>> -
>>>       /*
>>>        * Explicitly reserve space for special pages.
>>>        * This space starts at RESERVED_MEMBASE an extends to cover
>>> various
>>> @@ -191,16 +187,91 @@ int build_e820_table(struct e820entry *e820,
>>>           nr++;
>>>       }
>>>
>>> -
>>> -    if ( hvm_info->high_mem_pgend )
>>> +    /*
>>> +     * Construct E820 table according to recorded memory map.
>>> +     *
>>> +     * The memory map created by toolstack may include,
>>> +     *
>>> +     * #1. Low memory region
>>> +     *
>>> +     * Low RAM starts at least from 1M to make sure all standard
>>> regions
>>> +     * of the PC memory map, like BIOS, VGA memory-mapped I/O and
>>> vgabios,
>>> +     * have enough space.
>>> +     *
>>> +     * #2. Reserved regions if they exist
>>> +     *
>>> +     * #3. High memory region if it exists
>>> +     */
>>> +    for ( i = 0; i < memory_map.nr_map; i++ )
>>>       {
>>> -        e820[nr].addr = ((uint64_t)1 << 32);
>>> -        e820[nr].size =
>>> -            ((uint64_t)hvm_info->high_mem_pgend << PAGE_SHIFT) -
>>> e820[nr].addr;
>>> -        e820[nr].type = E820_RAM;
>>> +        e820[nr] = memory_map.map[i];
>>>           nr++;
>>>       }
>>>
>>> +    /* Low RAM goes here. Reserve space for special pages. */
>>> +    BUG_ON(low_mem_end < (2u << 20));
>>> +
>>> +    /*
>>> +     * Its possible to relocate RAM to allocate sufficient MMIO
>>> previously
>>> +     * so low_mem_pgend would be changed over there. And here
>>> memory_map[]
>>> +     * records the original low/high memory, so if low_mem_end is less
>>> than
>>> +     * the original we need to revise low/high memory range in e820.
>>> +     */
>>> +    for ( i = 0; i < nr; i++ )
>>> +    {
>>> +        uint64_t end = e820[i].addr + e820[i].size;
>>> +        if ( e820[i].type == E820_RAM &&
>>> +             low_mem_end > e820[i].addr && low_mem_end < end )
>>> +        {
>>> +            add_high_mem = end - low_mem_end;
>>> +            e820[i].size = low_mem_end - e820[i].addr;
>>> +        }
>>> +    }
>>> +
>>> +    /*
>>> +     * And then we also need to adjust highmem.
>>> +     */
>>> +    if ( add_high_mem )
>>> +    {
>>> +        /* Modify the existing highmem region if it exists. */
>>> +        for ( i = 0; i < nr; i++ )
>>> +        {
>>> +            if ( e820[i].type == E820_RAM &&
>>> +                 e820[i].addr == ((uint64_t)1 << 32))
>>> +            {
>>> +                e820[i].size += add_high_mem;
>>> +                break;
>>> +            }
>>> +        }
>>> +
>>> +        /* If there was no highmem region, just create one. */
>>> +        if ( i == nr )
>>> +        {
>>> +            e820[nr].addr = ((uint64_t)1 << 32);
>>> +            e820[nr].size = high_mem_end  - e820[nr].addr;
>>> +            e820[nr].type = E820_RAM;
>>> +            nr++;
>>> +        }
>>> +
>>> +        /* A sanity check if high memory is broken. */
>>> +        BUG_ON( high_mem_end != e820[i].addr + e820[i].size);
>>
>> The reason I wrote it the way I did was so that we would cross-check our
>> lowmem adjustments (via add_high_mem) with the value in hvm_info in
>> *both cases*.
>>
>> In the code above, you'll get the sanity check if we modify an existing
>> e820 entry; but if we create a new entry, then we don't check to make
>> sure that the amount we removed from the lowmem entry equals the amount
>> we added to the highmem entry.
> 
> Are you saying the following two cases are not same?
> 
> uint64_t high_mem_end = (uint64_t)hvm_info->high_mem_pgend << PAGE_SHIFT;
> BUG_ON( high_mem_end != e820[i].addr + e820[i].size);
> vs.
> BUG_ON(e820[i].addr+e820[i].size != ((uint64_t)hvm_info->high_mem_pgend
> << PAGE_SHIFT));
> 
> Why? Note hvm_info->high_mem_pgend don't change while build e820 table.
> 
> Honestly I didn't try to change that point but maybe I'm missing something?

Yes, you are missing something. :-)  I told you exactly what I wanted
changed and what I said could remain the same:

>> By all means, calculate high_mem_end so it's easier to read.  But then,
>> when creating a new region, set e820[nr].size = add_high_mem, so that
>> the BUG_ON() that follows actually checks something useful.

Just to be clear, I want the second if() statement to look like this:

>>> +        if ( i == nr )
>>> +        {
>>> +            e820[nr].addr = ((uint64_t)1 << 32);
>>> +            e820[nr].size = add_high_mem;
>>> +            e820[nr].type = E820_RAM;
>>> +            nr++;
>>> +        }

Think about why and maybe that will help you understand what I'm talking
about.

 -George

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxx
http://lists.xen.org/xen-devel


 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.