Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add option to control trailing zero in floating-point literals #5834

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

amatveiakin
Copy link

This is my first contribution to rustfmt. Feel free to nitpick :)

Also please tell me if I did the tests right. Three separate test files seem like a lot for such a small feature, but I don't see how else I could check all configurations.

Closes #3187

@ytmimi
Copy link
Contributor

ytmimi commented Jan 25, 2024

Hey @amatveiakin I know this hasn't gotten much attention from the team, but I wanted to check back in and see if you're still interested in working on this.

@amatveiakin
Copy link
Author

Hey @ytmimi, yes, I'm still interested. Also wrote up a short proposal in #3187, so that we can discuss high-level details there.

Copy link
Contributor

@ytmimi ytmimi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just wrapped up my initial review on this. Overall I think this is a good start, but I'd like us to expand on the test cases.

Configurations.md Show resolved Hide resolved
Configurations.md Outdated Show resolved Hide resolved
src/expr.rs Outdated
Comment on lines 1337 to 1341
return wrap_str(
context.snippet(span).to_owned(),
context.config.max_width(),
shape,
);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For Version::Two we don't check the line length of string literals because there's no meaningful way to break them by default. Similarity, I don't think there would be a meaningful way to break a float literal over multiple lines so let's avoid using wrap_str. Same goes for the wrap_str call below.

For reference, here's the code in rewrite_string_lit that I'm referring to:

rustfmt/src/expr.rs

Lines 1261 to 1263 in cedb7b5

&& context.config.version() == Version::Two
{
return Some(string_lit.to_owned());

Copy link
Author

@amatveiakin amatveiakin Feb 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please elaborate? I guess the statement “there is no meaningful way to break a float literal over multiple lines” is always true, so why would this depend on config version?

BTW, would you say that rewrite_int_lit should be updated as well? It also calls wrap_str unconditionally:

rustfmt/src/expr.rs

Lines 1306 to 1310 in cedb7b5

wrap_str(
context.snippet(span).to_owned(),
context.config.max_width(),
shape,
)

src/expr.rs Outdated Show resolved Hide resolved
Comment on lines 3 to 17
fn float_literals() {
let a = 0.;
let b = 0.0;
let c = 100.;
let d = 100.0;
let e = 5e3;
let f = 5.0e3;
let g = 7f32;
let h = 7.0f32;
let i = 9e3f32;
let j = 9.0e3f32;
let k = 1000.00;
let l = 1_000_.;
let m = 1_000_.000_000;
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a good start, but I'd really like to see us expand on these test cases.

  1. Can we move all float_literal_trailing_zero tests to a tests/{source|target}/config/float_literal_trailing_zero directory. Then you can name each test file as the name of the enum variant.
  2. To cover all our bases let's add a test case for Preserve. We should only need a target test file for the preserve case since we don't expect formatting to change.
  3. Let's try to be as thorough as we can when coming up with test cases. You've already highlighted let bindings, but there are plenty of other places where float literals can be written. A few that come to mind are conditional expressions, struct literals, match patterns, function arguments, array literals, macro calls, etc.

One thing that I'm particularly interested in seeing is what happens when we're at or near the max_width boundary. Does adding the extra 0 correctly wrap a function call or array literal when adding the extra 0 pushes us over the max_width, and do we correctly format on a single line when adding 0 pushes us to exactly the max_width.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also please tell me if I did the tests right. Three separate test files seem like a lot for such a small feature, but I don't see how else I could check all configurations.

Oh and to answer your question, this is correct. You'll need to create individual test files for each case!

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Done.
  2. Done.
  3. I've added one macro and one non-macro example that causes a line wrapping change. Do you think we really need to cover all these cases (conditional expressions, struct literals, match patterns, function arguments, etc.)? To me it feel a bit excessive. And it seems different from the current testing strategy (take hex_literal_case tests as a close example).


- **Default value**: `Preserve`
- **Possible values**: `Preserve`, `Always`, `IfNoPostfix`, `Never`
- **Stable**: No (tracking issue: [#3187](/~https://github.com/rust-lang/rustfmt/issues/3187))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, we'll need to add a new tracking issue for this one. #3187 isn't a tracking issue. We can certainly add the tracking issue after this PR is merged.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why don't we create an issue now, so that we don't have to go back and modify the link later?
Is there a document that describes how a tracking issue should look like?

src/expr.rs Outdated Show resolved Hide resolved
@amatveiakin
Copy link
Author

Apologies for the slow reply. Please take another look.

@ytmimi
Copy link
Contributor

ytmimi commented Mar 5, 2024

@amatveiakin No worries. I'll likely have some time to get back to this later this week. For your awareness I want to let you know that I just merged #6085, and I'd really like to see tests with range expressions like the ones in the PR. I want to make sure that the new float_literal_trailing_zero option that you're working on meshes well with the code that handles spaces before range expressions.

@Keavon
Copy link

Keavon commented Jan 1, 2025

Any updates on this effort?

@amatveiakin
Copy link
Author

@Keavon sorry, this completely slipped my mind. I'm going to take a look this weekend.

@amatveiakin amatveiakin reopened this Jan 5, 2025
@amatveiakin
Copy link
Author

@ytmimi you were absolutely right: my original implementation incorrectly reformatted 1.0..2.0 as 1...2.. I fixed this by adjusting lit_ends_in_dot to answer whether the literal ends with a dot after rewriting.
Please take another look.

@ytmimi
Copy link
Contributor

ytmimi commented Jan 6, 2025

I should have some time to revisit this later in the week. Thanks for following up on the implementation!

Copy link
Contributor

@ytmimi ytmimi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks again for revisiting this. I've left some additional feedback on the PR

src/expr.rs Outdated Show resolved Hide resolved
Comment on lines -1331 to +1340
format!(
"0x{}{}",
hex_lit,
token_lit.suffix.as_ref().map_or("", |s| s.as_str())
),
format!("0x{}{}", hex_lit, suffix.unwrap_or("")),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did this need to change?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No. I thought it would be nice to keep rewrite_int_lit and rewrite_float_lit_inner as close as possible, but I could revert it if you'd like.

src/expr.rs Outdated Show resolved Hide resolved
@@ -2273,9 +2342,34 @@ pub(crate) fn is_method_call(expr: &ast::Expr) -> bool {
}
}

struct FloatSymbolParts<'a> {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add a doc comment for this struct, and it's fields.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. BTW, is the policy on doc comments described somewhere? I thought only public entities should be doc-commented. I see now that it's not the case, but I'm not sure which comments should be doc vs non-doc for private items.

src/expr.rs Outdated Show resolved Hide resolved
src/expr.rs Outdated

// Parses a float literal. The `symbol` must be a valid floating point literal without a type
// suffix. Otherwise the function may panic or return wrong result.
fn parse_float_symbol(symbol: &str) -> FloatSymbolParts<'_> {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Given the ask to have this return Result, this could be implemented via FromStr, right?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure. Could you help me fix the lifetimes please?

When I tried

impl<'a> FromStr for FloatSymbolParts<'a> {
    fn from_str(s: &'a str) -> Result<Self, Self::Err> {  // <<< error here
        ...
    }
}

I got

method not compatible with trait
expected signature `fn(&_) -> std::result::Result<_, _>`
   found signature `fn(&'a _) -> std::result::Result<_, _>`

And when I tried

impl FromStr for FloatSymbolParts<'_> {
    fn from_str(s: &str) -> Result<Self, Self::Err> {
        ...
        Ok(FloatSymbolParts { ... })  // <<< error here
    }
}

I got

lifetime may not live long enough
associated function was supposed to return data with lifetime `'2` but it is returning data with lifetime `'1`

src/expr.rs Outdated
matches!(lit, Lit { kind: LitKind::Float, suffix: None, symbol } if symbol.as_str().ends_with('.'))
pub(crate) fn lit_ends_in_dot(lit: &Lit, context: &RewriteContext<'_>) -> bool {
match lit.kind {
LitKind::Float => do_rewrite_float_lit(context, *lit).ends_with('.'),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to do the whole rewrite in order to know if the literal is going to ends in a zero? For example, in the Preserve case the original logic would hold.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was torn on this myself. In fact, I implemented the non-rewriting version first. Here is how it looks like:

pub(crate) fn float_lit_ends_in_dot(lit: &Lit, context: &RewriteContext<'_>) -> bool {
    let symbol = lit.symbol.as_str();
    let suffix = lit.suffix.as_ref().map(|s| s.as_str());
    match context.config.float_literal_trailing_zero() {
        FloatLiteralTrailingZero::Preserve => symbol.ends_with('.'),
        FloatLiteralTrailingZero::IfNoPostfix | FloatLiteralTrailingZero::Always => false,
        FloatLiteralTrailingZero::Never => {
            let float_parts = parse_float_symbol(symbol).unwrap();
            let has_postfix = float_parts.exponent.is_some() || suffix.is_some();
            let fractional_part_nonzero = float_parts.is_fractional_part_zero();
            !has_postfix && !fractional_part_nonzero
        }
    }
}

I felt that it was not DRY enough and thus error-prone. I became even more convinced that having this parallel logic was error-prone when I realized that my first implementation contained an error :)
On the other hand, it feels to me that performance overhead from one additional format! (which is basically all what rewrite_float_lit_inner does that the function above does not) should be rather small. And this multiplied by the fact that floating-point ranges are rather rare, at least in my experience.
So in the end I felt that it was premature optimization and did the straightforward check.
WDYT?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add leading/trailing zeros to floats
3 participants