2020-06-15 11:29:07 +00:00
|
|
|
//! # Virtual File System
|
|
|
|
//!
|
2024-01-07 19:31:56 +00:00
|
|
|
//! VFS records all file changes pushed to it via [`set_file_contents`].
|
|
|
|
//! As such it only ever stores changes, not the actual content of a file at any given moment.
|
|
|
|
//! All file changes are logged, and can be retrieved via
|
2021-01-07 11:08:11 +00:00
|
|
|
//! [`take_changes`] method. The pack of changes is then pushed to `salsa` and
|
2020-06-15 11:29:07 +00:00
|
|
|
//! triggers incremental recomputation.
|
|
|
|
//!
|
2021-01-07 11:08:11 +00:00
|
|
|
//! Files in VFS are identified with [`FileId`]s -- interned paths. The notion of
|
|
|
|
//! the path, [`VfsPath`] is somewhat abstract: at the moment, it is represented
|
|
|
|
//! as an [`std::path::PathBuf`] internally, but this is an implementation detail.
|
2020-06-15 11:29:07 +00:00
|
|
|
//!
|
2021-01-07 11:08:11 +00:00
|
|
|
//! VFS doesn't do IO or file watching itself. For that, see the [`loader`]
|
|
|
|
//! module. [`loader::Handle`] is an object-safe trait which abstracts both file
|
|
|
|
//! loading and file watching. [`Handle`] is dynamically configured with a set of
|
|
|
|
//! directory entries which should be scanned and watched. [`Handle`] then
|
2020-06-15 11:29:07 +00:00
|
|
|
//! asynchronously pushes file changes. Directory entries are configured in
|
2021-01-07 11:08:11 +00:00
|
|
|
//! free-form via list of globs, it's up to the [`Handle`] to interpret the globs
|
2020-06-15 11:29:07 +00:00
|
|
|
//! in any specific way.
|
|
|
|
//!
|
2021-01-07 11:18:25 +00:00
|
|
|
//! VFS stores a flat list of files. [`file_set::FileSet`] can partition this list
|
|
|
|
//! of files into disjoint sets of files. Traversal-like operations (including
|
|
|
|
//! getting the neighbor file by the relative path) are handled by the [`FileSet`].
|
2021-01-07 11:08:11 +00:00
|
|
|
//! [`FileSet`]s are also pushed to salsa and cause it to re-check `mod foo;`
|
2020-06-15 11:29:07 +00:00
|
|
|
//! declarations when files are created or deleted.
|
|
|
|
//!
|
2021-01-07 11:18:25 +00:00
|
|
|
//! [`FileSet`] and [`loader::Entry`] play similar, but different roles.
|
2020-06-15 11:29:07 +00:00
|
|
|
//! Both specify the "set of paths/files", one is geared towards file watching,
|
2021-01-07 11:18:25 +00:00
|
|
|
//! the other towards salsa changes. In particular, single [`FileSet`]
|
2021-01-07 11:08:11 +00:00
|
|
|
//! may correspond to several [`loader::Entry`]. For example, a crate from
|
|
|
|
//! crates.io which uses code generation would have two [`Entries`] -- for sources
|
2020-06-15 11:29:07 +00:00
|
|
|
//! in `~/.cargo`, and for generated code in `./target/debug/build`. It will
|
2021-01-07 11:08:11 +00:00
|
|
|
//! have a single [`FileSet`] which unions the two sources.
|
|
|
|
//!
|
|
|
|
//! [`set_file_contents`]: Vfs::set_file_contents
|
|
|
|
//! [`take_changes`]: Vfs::take_changes
|
|
|
|
//! [`FileSet`]: file_set::FileSet
|
|
|
|
//! [`Handle`]: loader::Handle
|
|
|
|
//! [`Entries`]: loader::Entry
|
2022-07-20 12:59:42 +00:00
|
|
|
|
2020-12-09 15:41:35 +00:00
|
|
|
mod anchored_path;
|
2020-06-15 11:29:07 +00:00
|
|
|
pub mod file_set;
|
|
|
|
pub mod loader;
|
2021-01-07 11:08:11 +00:00
|
|
|
mod path_interner;
|
|
|
|
mod vfs_path;
|
2020-06-15 11:29:07 +00:00
|
|
|
|
2024-05-14 09:55:12 +00:00
|
|
|
use std::{fmt, hash::BuildHasherDefault, mem};
|
2020-06-15 11:29:07 +00:00
|
|
|
|
|
|
|
use crate::path_interner::PathInterner;
|
|
|
|
|
2020-12-09 15:41:35 +00:00
|
|
|
pub use crate::{
|
|
|
|
anchored_path::{AnchoredPath, AnchoredPathBuf},
|
|
|
|
vfs_path::VfsPath,
|
|
|
|
};
|
2024-05-14 09:55:12 +00:00
|
|
|
use indexmap::{map::Entry, IndexMap};
|
2020-06-15 11:29:07 +00:00
|
|
|
pub use paths::{AbsPath, AbsPathBuf};
|
|
|
|
|
2024-05-14 09:21:04 +00:00
|
|
|
use rustc_hash::FxHasher;
|
|
|
|
use stdx::hash_once;
|
2024-04-18 18:39:43 +00:00
|
|
|
use tracing::{span, Level};
|
|
|
|
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Handle to a file in [`Vfs`]
|
|
|
|
///
|
|
|
|
/// Most functions in rust-analyzer use this when they need to refer to a file.
|
2022-12-30 11:14:15 +00:00
|
|
|
#[derive(Copy, Clone, Debug, Ord, PartialOrd, Eq, PartialEq, Hash)]
|
2023-12-06 10:53:28 +00:00
|
|
|
pub struct FileId(u32);
|
2023-12-15 12:52:49 +00:00
|
|
|
// pub struct FileId(NonMaxU32);
|
2020-06-15 11:29:07 +00:00
|
|
|
|
2023-12-01 12:56:25 +00:00
|
|
|
impl FileId {
|
2024-07-17 15:35:40 +00:00
|
|
|
pub const MAX: u32 = 0x7fff_ffff;
|
2023-12-06 10:53:28 +00:00
|
|
|
|
|
|
|
#[inline]
|
2023-12-15 12:52:49 +00:00
|
|
|
pub const fn from_raw(raw: u32) -> FileId {
|
2024-07-17 15:35:40 +00:00
|
|
|
assert!(raw <= Self::MAX);
|
2023-12-06 10:53:28 +00:00
|
|
|
FileId(raw)
|
|
|
|
}
|
|
|
|
|
|
|
|
#[inline]
|
2024-07-17 15:35:40 +00:00
|
|
|
pub const fn index(self) -> u32 {
|
2023-12-06 10:53:28 +00:00
|
|
|
self.0
|
|
|
|
}
|
2023-12-01 12:56:25 +00:00
|
|
|
}
|
|
|
|
|
2023-05-04 06:48:59 +00:00
|
|
|
/// safe because `FileId` is a newtype of `u32`
|
2023-05-04 23:28:15 +00:00
|
|
|
impl nohash_hasher::IsEnabled for FileId {}
|
2022-08-25 18:31:02 +00:00
|
|
|
|
2024-01-07 19:31:56 +00:00
|
|
|
/// Storage for all file changes and the file id to path mapping.
|
2021-01-12 16:22:57 +00:00
|
|
|
///
|
2022-08-17 13:44:58 +00:00
|
|
|
/// For more information see the [crate-level](crate) documentation.
|
2020-06-15 11:29:07 +00:00
|
|
|
#[derive(Default)]
|
|
|
|
pub struct Vfs {
|
|
|
|
interner: PathInterner,
|
2024-01-07 19:31:56 +00:00
|
|
|
data: Vec<FileState>,
|
2024-05-14 09:55:12 +00:00
|
|
|
changes: IndexMap<FileId, ChangedFile, BuildHasherDefault<FxHasher>>,
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
|
|
|
|
2024-03-21 19:34:55 +00:00
|
|
|
#[derive(Copy, Clone, Debug, PartialEq, PartialOrd)]
|
2024-01-07 19:31:56 +00:00
|
|
|
pub enum FileState {
|
2024-05-14 09:55:12 +00:00
|
|
|
/// The file exists with the given content hash.
|
2024-05-14 09:21:04 +00:00
|
|
|
Exists(u64),
|
2024-03-21 19:34:55 +00:00
|
|
|
/// The file is deleted.
|
2024-01-07 19:31:56 +00:00
|
|
|
Deleted,
|
|
|
|
}
|
|
|
|
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Changed file in the [`Vfs`].
|
2023-01-25 13:46:06 +00:00
|
|
|
#[derive(Debug)]
|
2020-06-15 11:29:07 +00:00
|
|
|
pub struct ChangedFile {
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Id of the changed file
|
2020-06-15 11:29:07 +00:00
|
|
|
pub file_id: FileId,
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Kind of change
|
2024-01-07 19:31:56 +00:00
|
|
|
pub change: Change,
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
|
|
|
|
|
|
|
impl ChangedFile {
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Returns `true` if the change is not [`Delete`](ChangeKind::Delete).
|
2020-06-15 11:29:07 +00:00
|
|
|
pub fn exists(&self) -> bool {
|
2024-01-07 19:31:56 +00:00
|
|
|
!matches!(self.change, Change::Delete)
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
2021-01-12 16:22:57 +00:00
|
|
|
|
|
|
|
/// Returns `true` if the change is [`Create`](ChangeKind::Create) or
|
2024-01-07 19:31:56 +00:00
|
|
|
/// [`Delete`](Change::Delete).
|
2020-06-15 11:29:07 +00:00
|
|
|
pub fn is_created_or_deleted(&self) -> bool {
|
2024-05-14 09:55:12 +00:00
|
|
|
matches!(self.change, Change::Create(_, _) | Change::Delete)
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
2024-01-07 19:31:56 +00:00
|
|
|
|
2024-03-21 19:34:55 +00:00
|
|
|
/// Returns `true` if the change is [`Create`](ChangeKind::Create).
|
|
|
|
pub fn is_created(&self) -> bool {
|
2024-05-14 09:55:12 +00:00
|
|
|
matches!(self.change, Change::Create(_, _))
|
2024-03-21 19:34:55 +00:00
|
|
|
}
|
|
|
|
|
2024-03-19 15:12:56 +00:00
|
|
|
/// Returns `true` if the change is [`Modify`](ChangeKind::Modify).
|
|
|
|
pub fn is_modified(&self) -> bool {
|
2024-05-14 09:21:04 +00:00
|
|
|
matches!(self.change, Change::Modify(_, _))
|
2024-03-19 15:12:56 +00:00
|
|
|
}
|
|
|
|
|
2024-01-07 19:31:56 +00:00
|
|
|
pub fn kind(&self) -> ChangeKind {
|
|
|
|
match self.change {
|
2024-05-14 09:55:12 +00:00
|
|
|
Change::Create(_, _) => ChangeKind::Create,
|
2024-05-14 09:21:04 +00:00
|
|
|
Change::Modify(_, _) => ChangeKind::Modify,
|
2024-01-07 19:31:56 +00:00
|
|
|
Change::Delete => ChangeKind::Delete,
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
/// Kind of [file change](ChangedFile).
|
|
|
|
#[derive(Eq, PartialEq, Debug)]
|
|
|
|
pub enum Change {
|
|
|
|
/// The file was (re-)created
|
2024-05-14 09:55:12 +00:00
|
|
|
Create(Vec<u8>, u64),
|
2024-01-07 19:31:56 +00:00
|
|
|
/// The file was modified
|
2024-05-14 09:21:04 +00:00
|
|
|
Modify(Vec<u8>, u64),
|
2024-01-07 19:31:56 +00:00
|
|
|
/// The file was deleted
|
|
|
|
Delete,
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
|
|
|
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Kind of [file change](ChangedFile).
|
2024-01-07 19:31:56 +00:00
|
|
|
#[derive(Eq, PartialEq, Debug)]
|
2020-06-15 11:29:07 +00:00
|
|
|
pub enum ChangeKind {
|
2021-01-12 16:22:57 +00:00
|
|
|
/// The file was (re-)created
|
2020-06-15 11:29:07 +00:00
|
|
|
Create,
|
2021-01-12 16:22:57 +00:00
|
|
|
/// The file was modified
|
2020-06-15 11:29:07 +00:00
|
|
|
Modify,
|
2021-01-12 16:22:57 +00:00
|
|
|
/// The file was deleted
|
2020-06-15 11:29:07 +00:00
|
|
|
Delete,
|
|
|
|
}
|
|
|
|
|
|
|
|
impl Vfs {
|
2021-01-12 16:22:57 +00:00
|
|
|
/// Id of the given path if it exists in the `Vfs` and is not deleted.
|
2020-06-15 11:29:07 +00:00
|
|
|
pub fn file_id(&self, path: &VfsPath) -> Option<FileId> {
|
2024-05-14 09:55:12 +00:00
|
|
|
self.interner.get(path).filter(|&it| matches!(self.get(it), FileState::Exists(_)))
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
2021-01-12 16:22:57 +00:00
|
|
|
|
|
|
|
/// File path corresponding to the given `file_id`.
|
|
|
|
///
|
|
|
|
/// # Panics
|
|
|
|
///
|
|
|
|
/// Panics if the id is not present in the `Vfs`.
|
2024-02-29 15:28:59 +00:00
|
|
|
pub fn file_path(&self, file_id: FileId) -> &VfsPath {
|
|
|
|
self.interner.lookup(file_id)
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
2021-01-12 16:22:57 +00:00
|
|
|
|
|
|
|
/// Returns an iterator over the stored ids and their corresponding paths.
|
|
|
|
///
|
|
|
|
/// This will skip deleted files.
|
2020-07-14 13:57:10 +00:00
|
|
|
pub fn iter(&self) -> impl Iterator<Item = (FileId, &VfsPath)> + '_ {
|
2020-06-15 11:29:07 +00:00
|
|
|
(0..self.data.len())
|
|
|
|
.map(|it| FileId(it as u32))
|
2024-05-14 09:55:12 +00:00
|
|
|
.filter(move |&file_id| matches!(self.get(file_id), FileState::Exists(_)))
|
2020-06-15 11:29:07 +00:00
|
|
|
.map(move |file_id| {
|
2020-07-14 13:57:10 +00:00
|
|
|
let path = self.interner.lookup(file_id);
|
2020-06-15 11:29:07 +00:00
|
|
|
(file_id, path)
|
|
|
|
})
|
|
|
|
}
|
2021-01-12 16:22:57 +00:00
|
|
|
|
|
|
|
/// Update the `path` with the given `contents`. `None` means the file was deleted.
|
|
|
|
///
|
|
|
|
/// Returns `true` if the file was modified, and saves the [change](ChangedFile).
|
|
|
|
///
|
|
|
|
/// If the path does not currently exists in the `Vfs`, allocates a new
|
|
|
|
/// [`FileId`] for it.
|
2024-01-07 19:31:56 +00:00
|
|
|
pub fn set_file_contents(&mut self, path: VfsPath, contents: Option<Vec<u8>>) -> bool {
|
2024-04-18 18:39:43 +00:00
|
|
|
let _p = span!(Level::INFO, "Vfs::set_file_contents").entered();
|
2020-06-15 11:29:07 +00:00
|
|
|
let file_id = self.alloc_file_id(path);
|
2024-03-21 19:34:55 +00:00
|
|
|
let state = self.get(file_id);
|
|
|
|
let change_kind = match (state, contents) {
|
2024-01-07 19:31:56 +00:00
|
|
|
(FileState::Deleted, None) => return false,
|
2024-05-14 09:55:12 +00:00
|
|
|
(FileState::Deleted, Some(v)) => {
|
2024-05-14 09:21:04 +00:00
|
|
|
let hash = hash_once::<FxHasher>(&*v);
|
2024-05-14 09:55:12 +00:00
|
|
|
Change::Create(v, hash)
|
2024-05-14 09:21:04 +00:00
|
|
|
}
|
2024-05-14 09:55:12 +00:00
|
|
|
(FileState::Exists(_), None) => Change::Delete,
|
2024-05-14 09:21:04 +00:00
|
|
|
(FileState::Exists(hash), Some(v)) => {
|
|
|
|
let new_hash = hash_once::<FxHasher>(&*v);
|
|
|
|
if new_hash == hash {
|
|
|
|
return false;
|
|
|
|
}
|
|
|
|
Change::Modify(v, new_hash)
|
|
|
|
}
|
2024-03-21 19:34:55 +00:00
|
|
|
};
|
2024-05-14 09:55:12 +00:00
|
|
|
|
|
|
|
let mut set_data = |change_kind| {
|
|
|
|
self.data[file_id.0 as usize] = match change_kind {
|
|
|
|
&Change::Create(_, hash) | &Change::Modify(_, hash) => FileState::Exists(hash),
|
|
|
|
Change::Delete => FileState::Deleted,
|
|
|
|
};
|
2020-06-15 11:29:07 +00:00
|
|
|
};
|
2024-05-14 09:55:12 +00:00
|
|
|
|
2024-01-07 19:31:56 +00:00
|
|
|
let changed_file = ChangedFile { file_id, change: change_kind };
|
2024-05-14 09:55:12 +00:00
|
|
|
match self.changes.entry(file_id) {
|
|
|
|
// two changes to the same file in one cycle, merge them appropriately
|
|
|
|
Entry::Occupied(mut o) => {
|
|
|
|
use Change::*;
|
|
|
|
|
|
|
|
match (&mut o.get_mut().change, changed_file.change) {
|
|
|
|
// newer `Delete` wins
|
|
|
|
(change, Delete) => *change = Delete,
|
|
|
|
// merge `Create` with `Create` or `Modify`
|
|
|
|
(Create(prev, old_hash), Create(new, new_hash) | Modify(new, new_hash)) => {
|
|
|
|
*prev = new;
|
|
|
|
*old_hash = new_hash;
|
|
|
|
}
|
|
|
|
// collapse identical `Modify`es
|
|
|
|
(Modify(prev, old_hash), Modify(new, new_hash)) => {
|
|
|
|
*prev = new;
|
|
|
|
*old_hash = new_hash;
|
|
|
|
}
|
|
|
|
// equivalent to `Modify`
|
|
|
|
(change @ Delete, Create(new, new_hash)) => {
|
|
|
|
*change = Modify(new, new_hash);
|
|
|
|
}
|
|
|
|
// shouldn't occur, but collapse into `Create`
|
|
|
|
(change @ Delete, Modify(new, new_hash)) => {
|
|
|
|
stdx::never!();
|
|
|
|
*change = Create(new, new_hash);
|
|
|
|
}
|
|
|
|
// shouldn't occur, but keep the Create
|
|
|
|
(prev @ Modify(_, _), new @ Create(_, _)) => *prev = new,
|
|
|
|
}
|
|
|
|
set_data(&o.get().change);
|
|
|
|
}
|
|
|
|
Entry::Vacant(v) => set_data(&v.insert(changed_file).change),
|
|
|
|
};
|
|
|
|
|
2020-12-09 16:29:34 +00:00
|
|
|
true
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
2021-01-12 16:22:57 +00:00
|
|
|
|
|
|
|
/// Drain and returns all the changes in the `Vfs`.
|
2024-05-14 09:55:12 +00:00
|
|
|
pub fn take_changes(&mut self) -> IndexMap<FileId, ChangedFile, BuildHasherDefault<FxHasher>> {
|
2020-06-15 11:29:07 +00:00
|
|
|
mem::take(&mut self.changes)
|
|
|
|
}
|
2021-01-12 16:41:45 +00:00
|
|
|
|
2023-08-29 11:19:17 +00:00
|
|
|
/// Provides a panic-less way to verify file_id validity.
|
|
|
|
pub fn exists(&self, file_id: FileId) -> bool {
|
2024-05-14 09:55:12 +00:00
|
|
|
matches!(self.get(file_id), FileState::Exists(_))
|
2023-08-29 11:19:17 +00:00
|
|
|
}
|
|
|
|
|
2021-01-12 16:41:45 +00:00
|
|
|
/// Returns the id associated with `path`
|
|
|
|
///
|
|
|
|
/// - If `path` does not exists in the `Vfs`, allocate a new id for it, associated with a
|
2024-06-09 10:54:50 +00:00
|
|
|
/// deleted file;
|
2021-01-12 16:41:45 +00:00
|
|
|
/// - Else, returns `path`'s id.
|
|
|
|
///
|
|
|
|
/// Does not record a change.
|
2020-06-15 11:29:07 +00:00
|
|
|
fn alloc_file_id(&mut self, path: VfsPath) -> FileId {
|
|
|
|
let file_id = self.interner.intern(path);
|
|
|
|
let idx = file_id.0 as usize;
|
|
|
|
let len = self.data.len().max(idx + 1);
|
2024-01-07 19:31:56 +00:00
|
|
|
self.data.resize(len, FileState::Deleted);
|
2020-06-15 11:29:07 +00:00
|
|
|
file_id
|
|
|
|
}
|
2021-01-12 16:41:45 +00:00
|
|
|
|
2024-01-07 19:31:56 +00:00
|
|
|
/// Returns the status of the file associated with the given `file_id`.
|
2021-01-12 16:41:45 +00:00
|
|
|
///
|
|
|
|
/// # Panics
|
|
|
|
///
|
|
|
|
/// Panics if no file is associated to that id.
|
2024-01-07 19:31:56 +00:00
|
|
|
fn get(&self, file_id: FileId) -> FileState {
|
|
|
|
self.data[file_id.0 as usize]
|
2020-06-15 11:29:07 +00:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
impl fmt::Debug for Vfs {
|
|
|
|
fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
|
|
|
|
f.debug_struct("Vfs").field("n_files", &self.data.len()).finish()
|
|
|
|
}
|
|
|
|
}
|