mirror of
https://github.com/nushell/nushell
synced 2025-01-15 14:44:14 +00:00
36c1073441
# Description Closes #12535 Implements sort-by functionality of #8322 Fixes sort-by part of #8667 This PR does two main things: add a new cell path and closure parameter to `sort-by`, and attempt to make Nushell's sorting behavior well-defined. ## `sort-by` features The `columns` parameter is replaced with a `comparator` parameter, which can be a cell path or a closure. Examples are from docs PR. 1. Cell paths The basic interactive usage of `sort-by` is the same. For example, `ls | sort-by modified` still works the same as before. It is not quite a drop-in replacement, see [behavior changes](#behavior-changes). Here's an example of how the cell path comparator might be useful: ```nu > let cities = [ {name: 'New York', info: { established: 1624, population: 18_819_000 } } {name: 'Kyoto', info: { established: 794, population: 37_468_000 } } {name: 'São Paulo', info: { established: 1554, population: 21_650_000 } } ] > $cities | sort-by info.established ╭───┬───────────┬────────────────────────────╮ │ # │ name │ info │ ├───┼───────────┼────────────────────────────┤ │ 0 │ Kyoto │ ╭─────────────┬──────────╮ │ │ │ │ │ established │ 794 │ │ │ │ │ │ population │ 37468000 │ │ │ │ │ ╰─────────────┴──────────╯ │ │ 1 │ São Paulo │ ╭─────────────┬──────────╮ │ │ │ │ │ established │ 1554 │ │ │ │ │ │ population │ 21650000 │ │ │ │ │ ╰─────────────┴──────────╯ │ │ 2 │ New York │ ╭─────────────┬──────────╮ │ │ │ │ │ established │ 1624 │ │ │ │ │ │ population │ 18819000 │ │ │ │ │ ╰─────────────┴──────────╯ │ ╰───┴───────────┴────────────────────────────╯ ``` 2. Key closures You can supply a closure which will transform each value into a sorting key (without changing the underlying data). Here's an example of a key closure, where we want to sort a list of assignments by their average grade: ```nu > let assignments = [ {name: 'Homework 1', grades: [97 89 86 92 89] } {name: 'Homework 2', grades: [91 100 60 82 91] } {name: 'Exam 1', grades: [78 88 78 53 90] } {name: 'Project', grades: [92 81 82 84 83] } ] > $assignments | sort-by { get grades | math avg } ╭───┬────────────┬───────────────────────╮ │ # │ name │ grades │ ├───┼────────────┼───────────────────────┤ │ 0 │ Exam 1 │ [78, 88, 78, 53, 90] │ │ 1 │ Project │ [92, 81, 82, 84, 83] │ │ 2 │ Homework 2 │ [91, 100, 60, 82, 91] │ │ 3 │ Homework 1 │ [97, 89, 86, 92, 89] │ ╰───┴────────────┴───────────────────────╯ ``` 3. Custom sort closure The `--custom`, or `-c`, flag will tell `sort-by` to interpret closures as custom sort closures. A custom sort closure has two parameters, and returns a boolean. The closure should return `true` if the first parameter comes _before_ the second parameter in the sort order. For a simple example, we could rewrite a cell path sort as a custom sort (see [here](https://github.com/nushell/nushell.github.io/pull/1568/files#diff-a7a233e66a361d8665caf3887eb71d4288000001f401670c72b95cc23a948e86R231) for a more complex example): ```nu > ls | sort-by -c {|a, b| $a.size < $b.size } ╭───┬─────────────────────┬──────┬──────────┬────────────────╮ │ # │ name │ type │ size │ modified │ ├───┼─────────────────────┼──────┼──────────┼────────────────┤ │ 0 │ my-secret-plans.txt │ file │ 100 B │ 10 minutes ago │ │ 1 │ shopping_list.txt │ file │ 100 B │ 2 months ago │ │ 2 │ myscript.nu │ file │ 1.1 KiB │ 2 weeks ago │ │ 3 │ bigfile.img │ file │ 10.0 MiB │ 3 weeks ago │ ╰───┴─────────────────────┴──────┴──────────┴────────────────╯ ``` ## Making sort more consistent I think it's important for something as essential as `sort` to have well-defined semantics. This PR contains some changes to try to make the behavior of `sort` and `sort-by` consistent. In addition, after working with the internals of sorting code, I have a much deeper understanding of all of the edge cases. Here is my attempt to try to better define some of the semantics of sorting (if you are just interested in changes, skip to "User-Facing changes") - `sort`, `sort -v`, and `sort-by` now all work the same. Each individual sort implementation has been refactored into two functions in `sort_utils.rs`: `sort`, and `sort_by`. These can also be used in other parts of Nushell where values need to be sorted. - `sort` and `sort-by` used to handle `-i` and `-n` differently. - `sort -n` would consider all values which can't be coerced into a string to be equal - `sort-by -i` and `sort-by -n` would only work if all values were strings - In this PR, insensitive sort only affects comparison between strings, and natural sort only applies to numbers and strings (see below). - (not a change) Before and after this PR, `sort` and `sort-by` support sorting mixed types. There was a lot of discussion about potentially making `sort` and `sort-by` only work on lists of homogeneous types, but the general consensus was that `sort` should not error just because its input contains incompatible types. - In order to try to make working with data containing `null` values easier, I changed the PartialOrd order to sort `Nothing` values to the end of a list, regardless of what other types the list contains. Before, `null` would be sorted before `Binary`, `CellPath`, and `Custom` values. - (not a change) When sorted, lists of mixed types will contain sorted values of each type in order, for the most part - (not a change) For example, `[0x[1] (date now) "a" ("yesterday" | into datetime) "b" 0x[0]]` will be sorted as `["a", "b", a day ago, now, [0], [1]]`, where sorted strings appear first, then sorted datetimes, etc. - (not a change) The exception to this is `Int`s and `Float`s, which will intermix, `Strings` and `Glob`s, which will intermix, and `None` as described above. Additionally, natural sort will intermix strings with ints and floats (see below). - Natural sort no longer coerce all inputs to strings. - I did originally make natural only apply to strings, but @fdncred pointed out that the previous behavior also allowed you to sort numeric strings with numbers. This seems like a useful feature if we are trying to support sorting with mixed types, so I settled on coercing only numbers (int, float). This can be reverted if people don't like it. - Here is an example of this behavior in action, which is the same before and after this PR: ```nushell $ [1 "4" 3 "2"] | sort --natural ╭───┬───╮ │ 0 │ 1 │ │ 1 │ 2 │ │ 2 │ 3 │ │ 3 │ 4 │ ╰───┴───╯ ``` # User-Facing Changes ## New features - Replaces the `columns` string parameter of `sort-by` with a cell path or a closure. - The cell path parameter works exactly as you would expect - By default, the `closure` parameter acts as a "key sort"; that is, each element is transformed by the closure into a sorting key - With the `--custom` (`-c`) parameter, you can define a comparison function for completely custom sorting order. ## Behavior changes <details> <summary><code>sort -v</code> does not coerce record values to strings</summary> This was a bit of a surprising behavior, and is now unified with the behavior of `sort` and `sort-by`. Here's an example where you can observe the values being implicitly coerced into strings for sorting, as they are sorted like strings rather than numbers: Old behavior: ```nushell $ {foo: 9 bar: 10} | sort -v ╭─────┬────╮ │ bar │ 10 │ │ foo │ 9 │ ╰─────┴────╯ ``` New behavior: ```nushell $ {foo: 9 bar: 10} | sort -v ╭─────┬────╮ │ foo │ 9 │ │ bar │ 10 │ ╰─────┴────╯ ``` </details> <details> <summary>Changed <code>sort-by</code> parameters from <code>string</code> to <code>cell-path</code> or <code>closure</code>. Typical interactive usage is the same as before, but if passing a variable to <code>sort-by</code> it must be a cell path (or closure), not a string</summary> Old behavior: ```nushell $ let sort = "modified" $ ls | sort-by $sort ╭───┬──────┬──────┬──────┬────────────────╮ │ # │ name │ type │ size │ modified │ ├───┼──────┼──────┼──────┼────────────────┤ │ 0 │ foo │ file │ 0 B │ 10 hours ago │ │ 1 │ bar │ file │ 0 B │ 35 seconds ago │ ╰───┴──────┴──────┴──────┴────────────────╯ ``` New behavior: ```nushell $ let sort = "modified" $ ls | sort-by $sort Error: nu:🐚:type_mismatch × Type mismatch. ╭─[entry #10:1:14] 1 │ ls | sort-by $sort · ──┬── · ╰── Cannot sort using a value which is not a cell path or closure ╰──── $ let sort = $."modified" $ ls | sort-by $sort ╭───┬──────┬──────┬──────┬───────────────╮ │ # │ name │ type │ size │ modified │ ├───┼──────┼──────┼──────┼───────────────┤ │ 0 │ foo │ file │ 0 B │ 10 hours ago │ │ 1 │ bar │ file │ 0 B │ 2 minutes ago │ ╰───┴──────┴──────┴──────┴───────────────╯ ``` </details> <details> <summary>Insensitve and natural sorting behavior reworked</summary> Previously, the `-i` and `-n` worked differently for `sort` and `sort-by` (see "Making sort more consistent"). Here are examples of how these options result in different sorts now: 1. `sort -n` - Old behavior (types other than numbers, strings, dates, and binary sorted incorrectly) ```nushell $ [2sec 1sec] | sort -n ╭───┬──────╮ │ 0 │ 2sec │ │ 1 │ 1sec │ ╰───┴──────╯ ``` - New behavior ```nushell $ [2sec 1sec] | sort -n ╭───┬──────╮ │ 0 │ 1sec │ │ 1 │ 2sec │ ╰───┴──────╯ ``` 2. `sort-by -i` - Old behavior (uppercase words appear before lowercase words as they would in a typical sort, indicating this is not actually an insensitive sort) ```nushell $ ["BAR" "bar" "foo" 2 "FOO" 1] | wrap a | sort-by -i a ╭───┬─────╮ │ # │ a │ ├───┼─────┤ │ 0 │ 1 │ │ 1 │ 2 │ │ 2 │ BAR │ │ 3 │ FOO │ │ 4 │ bar │ │ 5 │ foo │ ╰───┴─────╯ ``` - New behavior (strings are sorted stably, indicating this is an insensitive sort) ```nushell $ ["BAR" "bar" "foo" 2 "FOO" 1] | wrap a | sort-by -i a ╭───┬─────╮ │ # │ a │ ├───┼─────┤ │ 0 │ 1 │ │ 1 │ 2 │ │ 2 │ BAR │ │ 3 │ bar │ │ 4 │ foo │ │ 5 │ FOO │ ╰───┴─────╯ ``` 3. `sort-by -n` - Old behavior (natural sort does not work when data contains non-string values) ```nushell $ ["10" 8 "9"] | wrap a | sort-by -n a ╭───┬────╮ │ # │ a │ ├───┼────┤ │ 0 │ 8 │ │ 1 │ 10 │ │ 2 │ 9 │ ╰───┴────╯ ``` - New behavior ```nushell $ ["10" 8 "9"] | wrap a | sort-by -n a ╭───┬────╮ │ # │ a │ ├───┼────┤ │ 0 │ 8 │ │ 1 │ 9 │ │ 2 │ 10 │ ╰───┴────╯ ``` </details> <details> <summary> Sorting a list of non-record values with a non-existent column/path now errors instead of sorting the values directly (<code>sort</code> should be used for this, not <code>sort-by</code>) </summary> Old behavior: ```nushell $ [2 1] | sort-by foo ╭───┬───╮ │ 0 │ 1 │ │ 1 │ 2 │ ╰───┴───╯ ``` New behavior: ```nushell $ [2 1] | sort-by foo Error: nu:🐚:incompatible_path_access × Data cannot be accessed with a cell path ╭─[entry #29:1:17] 1 │ [2 1] | sort-by foo · ─┬─ · ╰── int doesn't support cell paths ╰──── ``` </details> <details> <summary><code>sort</code> and <code>sort-by</code> output <code>List</code> instead of <code>ListStream</code> </summary> This isn't a meaningful change (unless I misunderstand the purpose of ListStream), since `sort` and `sort-by` both need to collect in order to do the sorting anyway, but is user observable. Old behavior: ```nushell $ ls | sort | describe -d ╭──────────┬───────────────────╮ │ type │ stream │ │ origin │ nushell │ │ subtype │ {record 3 fields} │ │ metadata │ {record 1 field} │ ╰──────────┴───────────────────╯ ``` ```nushell $ ls | sort-by name | describe -d ╭──────────┬───────────────────╮ │ type │ stream │ │ origin │ nushell │ │ subtype │ {record 3 fields} │ │ metadata │ {record 1 field} │ ╰──────────┴───────────────────╯ ``` New behavior: ```nushell ls | sort | describe -d ╭────────┬─────────────────╮ │ type │ list │ │ length │ 22 │ │ values │ [table 22 rows] │ ╰────────┴─────────────────╯ ``` ```nushell $ ls | sort-by name | describe -d ╭────────┬─────────────────╮ │ type │ list │ │ length │ 22 │ │ values │ [table 22 rows] │ ╰────────┴─────────────────╯ ``` </details> - `sort` now errors when nothing is piped in (`sort-by` already did this) # Tests + Formatting I added lots of unit tests on the new sort implementation to enforce new sort behaviors and prevent regressions. # After Submitting See [docs PR](https://github.com/nushell/nushell.github.io/pull/1568), which is ~2/3 finished. --------- Co-authored-by: NotTheDr01ds <32344964+NotTheDr01ds@users.noreply.github.com> Co-authored-by: Ian Manske <ian.manske@pm.me>
554 lines
17 KiB
Rust
554 lines
17 KiB
Rust
use nu_command::{sort, sort_by, sort_record, Comparator};
|
|
use nu_protocol::{
|
|
ast::{CellPath, PathMember},
|
|
record, Record, Span, Value,
|
|
};
|
|
|
|
#[test]
|
|
fn test_sort_basic() {
|
|
let mut list = vec![
|
|
Value::test_string("foo"),
|
|
Value::test_int(2),
|
|
Value::test_int(3),
|
|
Value::test_string("bar"),
|
|
Value::test_int(1),
|
|
Value::test_string("baz"),
|
|
];
|
|
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(1),
|
|
Value::test_int(2),
|
|
Value::test_int(3),
|
|
Value::test_string("bar"),
|
|
Value::test_string("baz"),
|
|
Value::test_string("foo")
|
|
]
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_nothing() {
|
|
// Nothing values should always be sorted to the end of any list
|
|
let mut list = vec![
|
|
Value::test_int(1),
|
|
Value::test_nothing(),
|
|
Value::test_int(2),
|
|
Value::test_string("foo"),
|
|
Value::test_nothing(),
|
|
Value::test_string("bar"),
|
|
];
|
|
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(1),
|
|
Value::test_int(2),
|
|
Value::test_string("bar"),
|
|
Value::test_string("foo"),
|
|
Value::test_nothing(),
|
|
Value::test_nothing()
|
|
]
|
|
);
|
|
|
|
// Ensure that nothing values are sorted after *all* types,
|
|
// even types which may follow `Nothing` in the PartialOrd order
|
|
|
|
// unstable_name_collision
|
|
// can be switched to std intersperse when stabilized
|
|
let mut values: Vec<Value> =
|
|
itertools::intersperse(Value::test_values(), Value::test_nothing()).collect();
|
|
|
|
let nulls = values
|
|
.iter()
|
|
.filter(|item| item == &&Value::test_nothing())
|
|
.count();
|
|
|
|
assert!(sort(&mut values, false, false).is_ok());
|
|
|
|
// check if the last `nulls` values of the sorted list are indeed null
|
|
assert_eq!(&values[(nulls - 1)..], vec![Value::test_nothing(); nulls])
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_natural_basic() {
|
|
let mut list = vec![
|
|
Value::test_string("foo99"),
|
|
Value::test_string("foo9"),
|
|
Value::test_string("foo1"),
|
|
Value::test_string("foo100"),
|
|
Value::test_string("foo10"),
|
|
Value::test_string("1"),
|
|
Value::test_string("10"),
|
|
Value::test_string("100"),
|
|
Value::test_string("9"),
|
|
Value::test_string("99"),
|
|
];
|
|
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_string("1"),
|
|
Value::test_string("10"),
|
|
Value::test_string("100"),
|
|
Value::test_string("9"),
|
|
Value::test_string("99"),
|
|
Value::test_string("foo1"),
|
|
Value::test_string("foo10"),
|
|
Value::test_string("foo100"),
|
|
Value::test_string("foo9"),
|
|
Value::test_string("foo99"),
|
|
]
|
|
);
|
|
|
|
assert!(sort(&mut list, false, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_string("1"),
|
|
Value::test_string("9"),
|
|
Value::test_string("10"),
|
|
Value::test_string("99"),
|
|
Value::test_string("100"),
|
|
Value::test_string("foo1"),
|
|
Value::test_string("foo9"),
|
|
Value::test_string("foo10"),
|
|
Value::test_string("foo99"),
|
|
Value::test_string("foo100"),
|
|
]
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_natural_mixed_types() {
|
|
let mut list = vec![
|
|
Value::test_string("1"),
|
|
Value::test_int(99),
|
|
Value::test_int(1),
|
|
Value::test_float(1000.0),
|
|
Value::test_int(9),
|
|
Value::test_string("9"),
|
|
Value::test_int(100),
|
|
Value::test_string("99"),
|
|
Value::test_float(2.0),
|
|
Value::test_string("100"),
|
|
Value::test_int(10),
|
|
Value::test_string("10"),
|
|
];
|
|
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(1),
|
|
Value::test_float(2.0),
|
|
Value::test_int(9),
|
|
Value::test_int(10),
|
|
Value::test_int(99),
|
|
Value::test_int(100),
|
|
Value::test_float(1000.0),
|
|
Value::test_string("1"),
|
|
Value::test_string("10"),
|
|
Value::test_string("100"),
|
|
Value::test_string("9"),
|
|
Value::test_string("99")
|
|
]
|
|
);
|
|
|
|
assert!(sort(&mut list, false, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(1),
|
|
Value::test_string("1"),
|
|
Value::test_float(2.0),
|
|
Value::test_int(9),
|
|
Value::test_string("9"),
|
|
Value::test_int(10),
|
|
Value::test_string("10"),
|
|
Value::test_int(99),
|
|
Value::test_string("99"),
|
|
Value::test_int(100),
|
|
Value::test_string("100"),
|
|
Value::test_float(1000.0),
|
|
]
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_natural_no_numeric_values() {
|
|
// If list contains no numeric strings, it should be sorted the
|
|
// same with or without natural sorting
|
|
let mut normal = vec![
|
|
Value::test_string("golf"),
|
|
Value::test_bool(false),
|
|
Value::test_string("alfa"),
|
|
Value::test_string("echo"),
|
|
Value::test_int(7),
|
|
Value::test_int(10),
|
|
Value::test_bool(true),
|
|
Value::test_string("uniform"),
|
|
Value::test_int(3),
|
|
Value::test_string("tango"),
|
|
];
|
|
let mut natural = normal.clone();
|
|
|
|
assert!(sort(&mut normal, false, false).is_ok());
|
|
assert!(sort(&mut natural, false, true).is_ok());
|
|
assert_eq!(normal, natural);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_natural_type_order() {
|
|
// This test is to prevent regression to a previous natural sort behavior
|
|
// where values of different types would be intermixed.
|
|
// Only numeric values (ints, floats, and numeric strings) should be intermixed
|
|
//
|
|
// This list would previously be incorrectly sorted like this:
|
|
// ╭────┬─────────╮
|
|
// │ 0 │ 1 │
|
|
// │ 1 │ golf │
|
|
// │ 2 │ false │
|
|
// │ 3 │ 7 │
|
|
// │ 4 │ 10 │
|
|
// │ 5 │ alfa │
|
|
// │ 6 │ true │
|
|
// │ 7 │ uniform │
|
|
// │ 8 │ true │
|
|
// │ 9 │ 3 │
|
|
// │ 10 │ false │
|
|
// │ 11 │ tango │
|
|
// ╰────┴─────────╯
|
|
|
|
let mut list = vec![
|
|
Value::test_string("golf"),
|
|
Value::test_int(1),
|
|
Value::test_bool(false),
|
|
Value::test_string("alfa"),
|
|
Value::test_int(7),
|
|
Value::test_int(10),
|
|
Value::test_bool(true),
|
|
Value::test_string("uniform"),
|
|
Value::test_bool(true),
|
|
Value::test_int(3),
|
|
Value::test_bool(false),
|
|
Value::test_string("tango"),
|
|
];
|
|
|
|
assert!(sort(&mut list, false, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_bool(false),
|
|
Value::test_bool(false),
|
|
Value::test_bool(true),
|
|
Value::test_bool(true),
|
|
Value::test_int(1),
|
|
Value::test_int(3),
|
|
Value::test_int(7),
|
|
Value::test_int(10),
|
|
Value::test_string("alfa"),
|
|
Value::test_string("golf"),
|
|
Value::test_string("tango"),
|
|
Value::test_string("uniform")
|
|
]
|
|
);
|
|
|
|
// Only ints, floats, and numeric strings should be intermixed
|
|
// While binary primitives and datetimes can be coerced into strings, it doesn't make sense to sort them with numbers
|
|
// Binary primitives can hold multiple values, not just one, so shouldn't be compared to single values
|
|
// Datetimes don't have a single obvious numeric representation, and if we chose one it would be ambiguous to the user
|
|
|
|
let year_three = chrono::NaiveDate::from_ymd_opt(3, 1, 1)
|
|
.unwrap()
|
|
.and_hms_opt(0, 0, 0)
|
|
.unwrap()
|
|
.and_utc();
|
|
|
|
let mut list = vec![
|
|
Value::test_int(10),
|
|
Value::test_float(6.0),
|
|
Value::test_int(1),
|
|
Value::test_binary([3]),
|
|
Value::test_string("2"),
|
|
Value::test_date(year_three.into()),
|
|
Value::test_int(4),
|
|
Value::test_binary([52]),
|
|
Value::test_float(9.0),
|
|
Value::test_string("5"),
|
|
Value::test_date(chrono::DateTime::UNIX_EPOCH.into()),
|
|
Value::test_int(7),
|
|
Value::test_string("8"),
|
|
Value::test_float(3.0),
|
|
Value::test_string("foobar"),
|
|
];
|
|
assert!(sort(&mut list, false, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(1),
|
|
Value::test_string("2"),
|
|
Value::test_float(3.0),
|
|
Value::test_int(4),
|
|
Value::test_string("5"),
|
|
Value::test_float(6.0),
|
|
Value::test_int(7),
|
|
Value::test_string("8"),
|
|
Value::test_float(9.0),
|
|
Value::test_int(10),
|
|
Value::test_string("foobar"),
|
|
// the ordering of date and binary here may change if the PartialOrd order is changed,
|
|
// but they should not be intermixed with the above
|
|
Value::test_date(year_three.into()),
|
|
Value::test_date(chrono::DateTime::UNIX_EPOCH.into()),
|
|
Value::test_binary([3]),
|
|
Value::test_binary([52]),
|
|
]
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_insensitive() {
|
|
// Test permutations between insensitive and natural
|
|
// Ensure that strings with equal insensitive orderings
|
|
// are sorted stably. (FOO then foo, bar then BAR)
|
|
let source = vec![
|
|
Value::test_string("FOO"),
|
|
Value::test_string("foo"),
|
|
Value::test_int(100),
|
|
Value::test_string("9"),
|
|
Value::test_string("bar"),
|
|
Value::test_int(10),
|
|
Value::test_string("baz"),
|
|
Value::test_string("BAR"),
|
|
];
|
|
let mut list;
|
|
|
|
// sensitive + non-natural
|
|
list = source.clone();
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(10),
|
|
Value::test_int(100),
|
|
Value::test_string("9"),
|
|
Value::test_string("BAR"),
|
|
Value::test_string("FOO"),
|
|
Value::test_string("bar"),
|
|
Value::test_string("baz"),
|
|
Value::test_string("foo"),
|
|
]
|
|
);
|
|
|
|
// sensitive + natural
|
|
list = source.clone();
|
|
assert!(sort(&mut list, false, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_string("9"),
|
|
Value::test_int(10),
|
|
Value::test_int(100),
|
|
Value::test_string("BAR"),
|
|
Value::test_string("FOO"),
|
|
Value::test_string("bar"),
|
|
Value::test_string("baz"),
|
|
Value::test_string("foo"),
|
|
]
|
|
);
|
|
|
|
// insensitive + non-natural
|
|
list = source.clone();
|
|
assert!(sort(&mut list, true, false).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_int(10),
|
|
Value::test_int(100),
|
|
Value::test_string("9"),
|
|
Value::test_string("bar"),
|
|
Value::test_string("BAR"),
|
|
Value::test_string("baz"),
|
|
Value::test_string("FOO"),
|
|
Value::test_string("foo"),
|
|
]
|
|
);
|
|
|
|
// insensitive + natural
|
|
list = source.clone();
|
|
assert!(sort(&mut list, true, true).is_ok());
|
|
assert_eq!(
|
|
list,
|
|
vec![
|
|
Value::test_string("9"),
|
|
Value::test_int(10),
|
|
Value::test_int(100),
|
|
Value::test_string("bar"),
|
|
Value::test_string("BAR"),
|
|
Value::test_string("baz"),
|
|
Value::test_string("FOO"),
|
|
Value::test_string("foo"),
|
|
]
|
|
);
|
|
}
|
|
|
|
// Helper function to assert that two records are equal
|
|
// with their key-value pairs in the same order
|
|
fn assert_record_eq(a: Record, b: Record) {
|
|
assert_eq!(
|
|
a.into_iter().collect::<Vec<_>>(),
|
|
b.into_iter().collect::<Vec<_>>(),
|
|
)
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_record_keys() {
|
|
// Basic record sort test
|
|
let record = record! {
|
|
"golf" => Value::test_string("bar"),
|
|
"alfa" => Value::test_string("foo"),
|
|
"echo" => Value::test_int(123),
|
|
};
|
|
|
|
let sorted = sort_record(record, false, false, false, false).unwrap();
|
|
assert_record_eq(
|
|
sorted,
|
|
record! {
|
|
"alfa" => Value::test_string("foo"),
|
|
"echo" => Value::test_int(123),
|
|
"golf" => Value::test_string("bar"),
|
|
},
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_record_values() {
|
|
// This test is to prevent a regression where integers and strings would be
|
|
// intermixed non-naturally when sorting a record by value without the natural flag:
|
|
//
|
|
// This record would previously be incorrectly sorted like this:
|
|
// ╭─────────┬─────╮
|
|
// │ alfa │ 1 │
|
|
// │ charlie │ 1 │
|
|
// │ india │ 10 │
|
|
// │ juliett │ 10 │
|
|
// │ foxtrot │ 100 │
|
|
// │ hotel │ 100 │
|
|
// │ delta │ 9 │
|
|
// │ echo │ 9 │
|
|
// │ bravo │ 99 │
|
|
// │ golf │ 99 │
|
|
// ╰─────────┴─────╯
|
|
|
|
let record = record! {
|
|
"alfa" => Value::test_string("1"),
|
|
"bravo" => Value::test_int(99),
|
|
"charlie" => Value::test_int(1),
|
|
"delta" => Value::test_int(9),
|
|
"echo" => Value::test_string("9"),
|
|
"foxtrot" => Value::test_int(100),
|
|
"golf" => Value::test_string("99"),
|
|
"hotel" => Value::test_string("100"),
|
|
"india" => Value::test_int(10),
|
|
"juliett" => Value::test_string("10"),
|
|
};
|
|
|
|
// non-natural sort
|
|
let sorted = sort_record(record.clone(), true, false, false, false).unwrap();
|
|
assert_record_eq(
|
|
sorted,
|
|
record! {
|
|
"charlie" => Value::test_int(1),
|
|
"delta" => Value::test_int(9),
|
|
"india" => Value::test_int(10),
|
|
"bravo" => Value::test_int(99),
|
|
"foxtrot" => Value::test_int(100),
|
|
"alfa" => Value::test_string("1"),
|
|
"juliett" => Value::test_string("10"),
|
|
"hotel" => Value::test_string("100"),
|
|
"echo" => Value::test_string("9"),
|
|
"golf" => Value::test_string("99"),
|
|
},
|
|
);
|
|
|
|
// natural sort
|
|
let sorted = sort_record(record.clone(), true, false, false, true).unwrap();
|
|
assert_record_eq(
|
|
sorted,
|
|
record! {
|
|
"alfa" => Value::test_string("1"),
|
|
"charlie" => Value::test_int(1),
|
|
"delta" => Value::test_int(9),
|
|
"echo" => Value::test_string("9"),
|
|
"india" => Value::test_int(10),
|
|
"juliett" => Value::test_string("10"),
|
|
"bravo" => Value::test_int(99),
|
|
"golf" => Value::test_string("99"),
|
|
"foxtrot" => Value::test_int(100),
|
|
"hotel" => Value::test_string("100"),
|
|
},
|
|
);
|
|
}
|
|
|
|
#[test]
|
|
fn test_sort_equivalent() {
|
|
// Ensure that sort, sort_by, and record sort have equivalent sorting logic
|
|
let phonetic = vec![
|
|
"alfa", "bravo", "charlie", "delta", "echo", "foxtrot", "golf", "hotel", "india",
|
|
"juliett", "kilo", "lima", "mike", "november", "oscar", "papa", "quebec", "romeo",
|
|
"sierra", "tango", "uniform", "victor", "whiskey", "xray", "yankee", "zulu",
|
|
];
|
|
|
|
// filter out errors, since we can't sort_by on those
|
|
let mut values: Vec<Value> = Value::test_values()
|
|
.into_iter()
|
|
.filter(|val| !matches!(val, Value::Error { .. }))
|
|
.collect();
|
|
|
|
// reverse sort test values
|
|
values.sort_by(|a, b| b.partial_cmp(a).unwrap());
|
|
|
|
let mut list = values.clone();
|
|
let mut table: Vec<Value> = values
|
|
.clone()
|
|
.into_iter()
|
|
.map(|val| Value::test_record(record! { "value" => val }))
|
|
.collect();
|
|
let record = Record::from_iter(phonetic.into_iter().map(str::to_string).zip(values));
|
|
|
|
let comparator = Comparator::CellPath(CellPath {
|
|
members: vec![PathMember::String {
|
|
val: "value".to_string(),
|
|
span: Span::test_data(),
|
|
optional: false,
|
|
}],
|
|
});
|
|
|
|
assert!(sort(&mut list, false, false).is_ok());
|
|
assert!(sort_by(
|
|
&mut table,
|
|
vec![comparator],
|
|
Span::test_data(),
|
|
false,
|
|
false
|
|
)
|
|
.is_ok());
|
|
|
|
let record_sorted = sort_record(record.clone(), true, false, false, false).unwrap();
|
|
let record_vals: Vec<Value> = record_sorted.into_iter().map(|pair| pair.1).collect();
|
|
|
|
let table_vals: Vec<Value> = table
|
|
.clone()
|
|
.into_iter()
|
|
.map(|record| record.into_record().unwrap().remove("value").unwrap())
|
|
.collect();
|
|
|
|
assert_eq!(list, record_vals);
|
|
assert_eq!(record_vals, table_vals);
|
|
// list == table_vals by transitive property
|
|
}
|