-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PE: Support multiple debug directories and VCFeature, Repro, ExDllCharacteristics, POGO parsers #403
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this seems fine to me, other than the fixes w.r.t. &Vec<T>
-> &[T]
@kkent030315 thanks for this PR, I think if we fix the basic nits this is ready to go, thank you! |
I'd also like to see this one go in for sure :) |
@m4b Thank you for the review! I'm going to mess with this PR very soon. |
- Added decent docs for `pe::debug` structs and constants - Added integration tests for `pe::debug::DebugData`
@m4b Hi, this PR has been refactored, please take a look #403 (comment) for up-to-date changelists. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
initial review, did not dive in deeply; main concern was initially with the allocations; I can't recall offhand, but there is probably some trait you can implement to allow find
to work with your iterator, to prevent the allocation; you'll likely need clone/copy on the iterator since it'll likely consume it, but that should be fine.
also would like to understand why the breaking change is necessary, can you give some motivation for that?
lastly, thank you so much (as usual!) for your incredible documentation on the code you commit, truly outstanding stuff!
src/pe/debug.rs
Outdated
Self::parse_with_opts(bytes, idd, &options::ParseOptions::default()) | ||
} | ||
|
||
pub fn parse_with_opts( | ||
bytes: &'a [u8], | ||
idd: &ImageDebugDirectory, | ||
idd: ImageDebugDirectoryIterator<'_>, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
breaking change; is this avoidable?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it is avoidable. This PR was initially designed and assumed subject to breaking change, while I'm pretty sure we could do that w/o actually breaking it. I give it a shot.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@m4b Alright, everything looks improved and there are no explicit alloc's left there. On the other hand, a breaking pub
change in debug::DebugData::parse_with_opts_and_fixup
for TE seems inevitable at this time. We actually can let them alone, but that's out of our original purpose to support multiple debug data directories.
Perhaps, there's a two ideas:
- Let the TE breaking changes alone and release the rest as minor changes,
Then, do the TE breaking changes separately in a 0.10 rollup. - Or, rollup everything, including the breaking TE change, in this PR as-is.
Which one do you prefer?
I think this is ready to go once the above suggestions are applied. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ARGH i forgot to submit these comments (a long time ago :( )
@m4b Thank you for the review. Everything addressed. 👍 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
great stuff as usual, thank you! (and thank you for your patience!)
offset, dd.size, bytes.len() | ||
))); | ||
} | ||
let data = &bytes[offset..offset + dd.size as usize]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
probably in the future we should port this kind of code to something like:
let Some (data) = bytes.get(offset..offset + dd.size as usize) else {
return Err(error::Error::Malformed(format!(
"ImageDebugDirectory offset {:#x} and size {:#x} exceeds the bounds of the bytes size {:#x}",
offset, dd.size, bytes.len()
)));
};
this is clearer, and doesn't repeat the bounds, but we its minor nit. I should have remembered this earlier, as we've done these in several other places, but the else
guard syntax only merged fairly recently (maybe a year or so ago iirc, but maybe even longer 😅 )
if idd.data_type != IMAGE_DEBUG_TYPE_CODEVIEW { | ||
// not a codeview debug directory | ||
// that's not an error, but it's not a CodeviewPDB70DebugInfo either | ||
return Ok(None); | ||
} | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i'm just curious why did this get removed? Is it because we now parse CODEVIEW?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The check is moved to the caller:
if let Some(idd) = &it.find_type(IMAGE_DEBUG_TYPE_CODEVIEW) {
codeview_pdb70_debug_info = CodeviewPDB70DebugInfo::parse_with_opts(bytes, idd, opts)?;
codeview_pdb20_debug_info = CodeviewPDB20DebugInfo::parse_with_opts(bytes, idd, opts)?;
}
Consumers (callers) should be aware of debug types whenever they are calling any of IMAGE_DEBUG_TYPE_*
parsers.
NB: breaking change |
This PR addresses the issue #314.
- The change is straightforward. It makes theDebugData::image_debug_directory: ImageDebugDirectory
intoVec<ImageDebugDirectory>
as someone pointed out in the #314 (comment). So this is to be an breaking change.- There is another addition of VC feature metadataIMAGE_DEBUG_TYPE_VC_FEATURE
(IMAGE_DEBUG_VC_FEATURE_ENTRY
) in the debug directory.If anyone have suggestion of making this semantics without breaking the backward compatibility I am open to discuss.
update:
DebugData
and its parsers for comfortable code that makes multiple debug directories in mind.TE::fixup_debug_data
has been merged toDebugData::parse_*
: AsDebugData
is now more compatible with TE (Terse Executable) by addingDebugData::parse_with_opts_and_fixup
and fixup RVA field inDebugData
andImageDebugDirectoryIterator
in order if any fixup is required forImageDebugDirectory::address_of_raw_data
andImageDebugDirectory::pointer_to_raw_data
.pe::debug
.IMAGE_DEBUG_TYPE_VC_FEATURE
.IMAGE_DEBUG_TYPE_REPRO
.IMAGE_DEBUG_TYPE_EX_DLLCHARACTERISTICS
.IMAGE_DEBUG_TYPE_POGO
.DebugData
parsers.impl ImageDebugDirectory
and it has been merged toDebugData::parse_*
.