nushell/crates
Jack Wright f879c00f9d
The ability to specify a schema when using dfr open and dfr into-df (#11634)
# Description

There are times where explicitly specifying a schema for a dataframe is
needed such as:
- Opening CSV and JSON lines files and needing provide more information
to polars to keep it from failing or in a desire to override default
type conversion
- When converting a nushell value to a dataframe and wanting to override
the default conversion behaviors.

This pull requests provides:
- A flag to allow specifying a schema when using dfr into-df
- A flag to allow specifying a schema when using dfr open that works for
CSV and JSON types
- A new command `dfr schema` which displays schema information and will
allow display support schema dtypes

Schema is specified creating a record that has the key value and the
dtype. Examples usages:

```
{a:1, b:{a:2}} | dfr into-df -s {a: u8, b: {a: i32}} | dfr schema
{a: 1, b: {a: [1 2 3]}, c: [a b c]} | dfr into-df -s {a: u8, b: {a: list<u64>}, c: list<str>} | dfr schema
 dfr open -s {pid: i32, ppid: i32, name: str, status: str, cpu: f64, mem: i64, virtual: i64} /tmp/ps.jsonl  | dfr schema
```

Supported dtypes:
null                                                   
bool                                                   
u8                                                     
u16                                                    
u32                                                    
u64                                                    
i8                                                     
i16                                                    
i32                                                    
i64                                                    
f32                                                    
f64                                                    
str                                                    
binary                                                 
date                                                   
datetime[time_unit: (ms, us, ns) timezone (optional)]  
duration[time_unit: (ms, us, ns)]                      
time                                                   
object                                                 
unknown                                                
list[dtype]


structs are also supported but are specified via another record:
{a: u8, b: {d: str}}

Another feature with the dfr schema command is that it returns the data
back in a format that can be passed to provide a valid schema that can
be passed in as schema argument:

<img width="638" alt="Screenshot 2024-01-29 at 10 23 58"
src="https://github.com/nushell/nushell/assets/56345/b49c3bff-5cda-4c86-975a-dfd91d991373">

---------

Co-authored-by: Jack Wright <jack.wright@disqo.com>
2024-01-29 13:26:04 -06:00
..
nu-cli add match-text style + config setting for ide menu (#11670) 2024-01-29 09:59:01 -06:00
nu-cmd-base Unify glob behavior on open, rm, cp-old, mv, umv, cp and du commands (#11621) 2024-01-26 21:57:35 +08:00
nu-cmd-dataframe The ability to specify a schema when using dfr open and dfr into-df (#11634) 2024-01-29 13:26:04 -06:00
nu-cmd-extra update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-cmd-lang update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-color-config update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-command use constant instead of <0 for ls fix (#11642) 2024-01-29 13:17:04 -06:00
nu-engine add $.extra_usage to modules (#11649) 2024-01-27 17:49:21 +02:00
nu-explore update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-glob Bump to dev version 0.89.1 (#11513) 2024-01-11 00:19:21 +13:00
nu-json Bump to dev version 0.89.1 (#11513) 2024-01-11 00:19:21 +13:00
nu-lsp Fix "Char index out of bounds" Error (#11526) 2024-01-11 15:24:49 -06:00
nu-parser Unify glob behavior on open, rm, cp-old, mv, umv, cp and du commands (#11621) 2024-01-26 21:57:35 +08:00
nu-path Bump to dev version 0.89.1 (#11513) 2024-01-11 00:19:21 +13:00
nu-plugin Plugin explicit flags (#11581) 2024-01-22 15:00:43 -06:00
nu-pretty-hex update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-protocol Unify glob behavior on open, rm, cp-old, mv, umv, cp and du commands (#11621) 2024-01-26 21:57:35 +08:00
nu-std cleanup hide testing PR (#11638) 2024-01-25 06:49:04 -06:00
nu-system Fix tarpaulin skip attribute to latest (#11552) 2024-01-17 07:26:34 -06:00
nu-table update nu-ansi-term to 0.50, lscolors to 0.17, and add the Style attribute to Suggestion (#11635) 2024-01-24 20:57:15 -08:00
nu-term-grid Bump to dev version 0.89.1 (#11513) 2024-01-11 00:19:21 +13:00
nu-test-support Remove duplicate which 4.4.2 (#11613) 2024-01-22 09:28:47 -06:00
nu-utils add match-text style + config setting for ide menu (#11670) 2024-01-29 09:59:01 -06:00
nu_plugin_custom_values Allow plugins to receive configuration from the nushell configuration (#10955) 2024-01-15 16:59:47 +08:00
nu_plugin_example Plugin explicit flags (#11581) 2024-01-22 15:00:43 -06:00
nu_plugin_formats Bump ical from 0.9.0 to 0.10.0 (#11661) 2024-01-29 09:34:06 +08:00
nu_plugin_gstat Allow plugins to receive configuration from the nushell configuration (#10955) 2024-01-15 16:59:47 +08:00
nu_plugin_inc Plugin explicit flags (#11581) 2024-01-22 15:00:43 -06:00
nu_plugin_python remove vectorize_over_list from python plugin (#9905) 2023-08-03 16:46:48 +02:00
nu_plugin_query Plugin explicit flags (#11581) 2024-01-22 15:00:43 -06:00
README.md Remove old nushell/merge engine-q 2022-02-07 14:54:06 -05:00

Nushell core libraries and plugins

These sub-crates form both the foundation for Nu and a set of plugins which extend Nu with additional functionality.

Foundational libraries are split into two kinds of crates:

  • Core crates - those crates that work together to build the Nushell language engine
  • Support crates - a set of crates that support the engine with additional features like JSON support, ANSI support, and more.

Plugins are likewise also split into two types:

  • Core plugins - plugins that provide part of the default experience of Nu, including access to the system properties, processes, and web-connectivity features.
  • Extra plugins - these plugins run a wide range of different capabilities like working with different file types, charting, viewing binary data, and more.