Open scottmcm opened 5 years ago
(Note it can be nice, however, to be able to define types locally within functions and then attach impls to those types...)
If we decide to spend time investigating this, it'd be great to also consider IDE support case.
IIRC (and somewhat oversimplifying) we can introduce impl blocks (and trait impls) in nested scoped that affect the outer scope, which means that the local information is not enough to reason about associated consts/methods of a given type (to simplify this somehow) and we potentially need to scan the entire crate to do that.
@matklad I think this is related to "quick parse" that we talked about which could also skip parsing the entire function bodies, right?
This approach, if feasible at all and if it doesn't reasonably reduce expressiveness, could potentially speed up some name/type resolution.
Though at the same time, it's nice to allow all sorts of things for how ?eval bots work on Discord and IRC
It’s also nice in cases like evcxr, which provides a repl and jupyter notebook support for rust. plotters uses evcxr in some of its examples https://plotters-rs.github.io/plotters-doc-data/evcxr-jupyter-integration.html
unfortunate for incremental
It’s bad for laziness, not for incremental.
The property we want here for IDE is:
Function body can be type-checked without looking at other functions’ bodies
That is, it’s not the nesting itself that creates troubles, but the fact that nested items are sometimes observable from the outside. Here’s the list of things that cause trouble that I know of:
The firsts two can be fixed by treating item bodies as downstream crates from coherence pov.
Additionally, another trouble some feature for ide is
IDE gets requests like “what’s the definition of thing at offset 92 in file foo.rs”. The first thing IDE needs to do is to figure out, from the set of known crate roots, how foo.rs fits into the module system. At the moment, to do this precisely one has to look into the body of every function, precisely because a mod declaration might be hidden there. Note that we currently require an explicit path attribute for local module, but that doesn’t really help, as you need still need to parse everything to find out there isn’t a module with an attr.
Note also that we would still need to parse some function bodies due to autotrait leakage via impl trait, but only if you directly use that function. This is different from having to parse all function bodies because somewhere and odd impl might be hidden, which affects type inference.
Beyond function bodies, it's currently permitted inside const bodies, including const generics. For a hopefully unrealistic example:
#![feature(const_generics)]
struct Foo<const T: usize>;
struct Bar;
const C: usize = {
impl Bar { const fn a() -> usize { 42 } };
Bar::a() + Bar::b()
};
impl Foo<{
mod m { impl super::Bar { pub const fn b() -> usize { 42 } } }
C
}> {}
imo being able to impl a trait on a struct defined in the same scope is a think i would want to keep. my main example for it would be a serde use case
#[derive(Serialize, Debug, Eq, PartialEq, Clone)]
#[serde(untagged)]
pub enum StringOrNumber {
String(String),
Number(u64),
}
impl<'de> de::Deserialize<'de> for StringOrNumber {
fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where
D: de::Deserializer<'de>,
{
struct Visitor;
impl<'de> de::Visitor<'de> for Visitor {
type Value = StringOrNumber;
/** .... further Visitor impl here **/
}
deserializer.deserialize_any(Visitor)
}
}
it allows one to implement the visitor for a struct, without exposing it to the scope around it.
I think we should keep the ability to do this.
Given that there are already four comments saying "we should allow impls for types that are themselves nested", I feel it would be useful to repeat:
EDIT: I think I first heard the coherence formulation from @vlad20012.
To give an example on the case of trait impls:
// very useful, not problematic for lazy compilation. keep this
pub fn f() {
struct S;
impl Trait for S {}
}
// bad, also not usually useful. do not keep
pub struct S;
pub fn f() {
impl Trait for S {}
}
Another slightly problematic thing (again by vlad20012 ) are inner function attributes:
fn foo() {
#![cfg(unix)]
}
fn main() {
foo()
}
Strictly speaking, we can live with that, as it only requires bounded parsing of the prefix of the function. But not having that seems better.
@dtolnay
// bad, also not usually useful. do not keep
pub struct S;
pub fn f() {
impl Trait for S {}
}
There may be an exception when the trait is defined in f
? This feels related to coherence to me.
e.g.
pub struct S;
pub fn f() {
trait Trait {}
impl Trait for S {}
}
Nominating for the @rust-lang/lang meeting -- if we want these to happen, we're going to have to pay attention.
It's probably also worth pointing out that impl Trait
creates a minor hole in the "type-check function body without looking at impls of other functions" approach due to handling of auto traits. This probably isn't hugely impactful from a practical perspective, but it's more than slightly annoying that you'd have to parse the bodies of -> impl Trait
functions in order to correctly check Send
/Sync
/Unpin
bounds.
FWIW, there are also people (myself included), that try to reduce the number of "meaningful top-level items" or to have "deeply tied code sit next to each other".
An example of the former would be:
impl DerefMut for SomeType {
fn deref_mut … -> &mut SomePointee {
impl Deref for SomeType { … }
…
}
}
impl Ord for SomeType {
fn cmp … {
impl PartialOrd for SomeType {
… Some(self.cmp(other)) …
}
…
}
}
// And on nightly
impl Fn<(…,)> for SomeType {
fn call (&self, …) -> Ret {
impl FnMut…
impl FnOnce…
…
}
}
An example of the latter would be:
struct WrapsAPtr(*const Pointee);
impl WrapsAPtr {
pub fn new(…) -> Self {
let pointee = …;
let ptr = Box::into_raw(Box::new(pointee));
impl Drop for WrapsAPtr { fn drop(&mut self) { unsafe {
let Self(ptr) = *self;
drop::<Box<Pointee>>(Box::from_raw(ptr as _));
}}}
WrapsAPtr(ptr)
}
}
This way witching from Box
to Arc
is locally contained within new()
.
pub fn new(…) -> Self {
let pointee = …;
- let ptr = Box::into_raw(Box::new(pointee));
+ let ptr = Arc::into_raw(Arc::new(pointee));
impl Drop for WrapsAPtr { fn drop(&mut self) { unsafe {
let Self(ptr) = *self;
- drop::<Box<Pointee>>(Box::from_raw(ptr as _));
+ drop::<Arc<Pointee>>(Arc::from_raw(ptr as _));
}}}
WrapsAPtr(ptr)
}
In that regarding, losing the ability to do either of these things would be a big deal breaker w.r.t. upgrading the edition I use, and I'm afraid coherence would prevent both of these.
Another nice thing to do is to avoid polluting the top-level namespace when dealing with some impls, such as:
const _: () = {
use ::core::fmt::{self, Debug, Display};
impl Display for MyType { … }
};
Finally, regarding inline modules inside an fn
body, that's also a desireable feature, not only for eval
queries with the Playground, but also for macros that may mock types using modules: Example with a macro implementing type-level enum
s.
On the other hand, I have never encountered the need for a non-inline module defined within a function; although one can always emulate that with an inline module by using include!
, I guess.
We discussed this issue at a recent lang design team meeting (agenda item).
Here is my attempt to summarize the discussion as posted on this issue, as well as convey concerns raised during the lang team meeting itself.
rustdoc
, where you need in some cases to have item declarations sitting alongside Rust expressions, and that is something that you can only do within an fn body
.The basic consensus of the lang design team was that we are not prepared to address this for the 2021 edition.
We might be willing to consider a soft-deprecation of the feature in the future, that would not need to be tied to an edition, but that can be a separate proposal (that would presumably be written in a manner that takes the dialogue here into account).
(We didn't actually directly address the more narrow proposal put forth by matklad in the lang team meeting. But I think the same principles hold there: We are not prepared to address that for the 2021 edition. We would be willing to consider strategies for revising/restricting the semantics that are not tied to an edition.)
So, the lang design team recommends that this issue be closed.
@rfcbot fcp close
Team member @pnkfelix has proposed to close this. The next step is review by the rest of the tagged team members:
No concerns currently listed.
Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!
See this document for info about what commands tagged team members can give me.
To clarify: my position is that this is a change we should be considering, but not for the 2021 edition. There is too much remaining design and implementation work for this to ship in that constrained of a time period.
:bell: This is now entering its final comment period, as per the review above. :bell:
Totally agreed for 2021. What's the right way to track it for 2024? The tag here is "202x"; should we have a "edition-the-next-one" tag? (Do we need an FCP to move out of an edition? Could we just say "nope" and change the label?)
It would be generally useful to have an A-edition-wishlist
label, to make it easier to gather up ideas for edition-related changes at the beginning of each edition cycle. Agreed that this issue ought to be postponed, rather than closed.
The final comment period, with a disposition to close, as per the review above, is now complete.
As the automated representative of the governance process, I would like to thank the author for their work and everyone else who contributed.
I believe this should be tagged disposition-postpone, not disposition-close, as the comments above indicate that this is wanted, just not now.
sure. adjusted.
Following up on this: if we want to do this in the 2024 edition, someone would need to be enthusiastic about starting that work within the next year. Does someone want to champion this?
@joshtriplett I will do it
There is an RFC now https://github.com/rust-lang/rfcs/pull/3373
For example, do we really want
mod
s inside random function? Similarly, IIRC https://youtu.be/LIYkT3p5gTs mentioned that it's unfortunate for incremental that we allowimpl
s of random things inside functions.Though at the same time, it's nice to allow all sorts of things for how
?eval
bots work on Discord and IRC -- just put all the code inside a block that's passed toprintln!
and it just works.