Dipper
, (*1)
Dipper is fast YAML parser that parses for the more commonly-used subset of YAMLâs v1.0 and v1.2 official specifications. It is being made freely available for use in personal or commercial projects under the BSD 3-clause license., (*2)
Philosophy
One has to question themselves when they sit down to create a YAML parser considering there are already a number of nice solutions out there. Thereâs SPYC, which does a wonderful job parsing for YAML 1.0, and Symfony also has a YAML parser that parses to YAML 1.2 specifications. Neither project claims to completely support all aspects of the specs that they repsectively parse for (the specs are quite large and in-depth), but they both support a large subset of the defined features., (*3)
But it occurred to us that we rarely use most of those features. Perhaps weâre not YAML power-users, but the subset we find ourselves using mostly is the same YAML that those libraries use when they convert straight PHP back to YAML. Mostly: normal key/value pairs (strings, scalars, numbers & booleans), lists, and maps., (*4)
Part of YAMLâs design was defining a somewhat complex-to-parse syntax in exchange for simple (yet powerful) formatting for human readability. We found that a lot of the parsing complexity came in supporting features above and beyond the simple subset of YAML that we actually use., (*5)
And thus, Dipper was born. Itâs built for speed, micro-optimized to parse the parts of YAML that we actually use, and nothing more., (*6)
Results
Weâve run a couple of benchmarks to make sure that Dipper is quick and thus, is worth releasing, and here is what we found. Keep in mind that weâre not scientists, but are here as an example of comparison., (*7)
Parser | YAML->PHP | PHP->YAML
------------+-------------+-------------
SPYC | ~22ms | ~17ms
Symfony | ~24ms | ~12ms
Dipper | ~10ms | ~ 3ms
We ran the same 500-line YAML file through each of the parsers described in this document 250 times. We then took the average time for each parser to parse the document to get the YAML->PHP
time. Next, we parsed the 500-line YAML file into PHP and then converted that back into YAML 250 times per parser. The average time to convert it back are the PHP->YAML
times., (*8)
As you can see by these times, Dipper comes out ahead in both parsing YAML and building it back up., (*9)
Usage
Dipper performs two tasks: it converts well-formed YAML into PHP, and it converts PHP into well-formed YAML., (*10)
// include Dipper and ready it for use
require('Dipper.php');
use secondparty\Dipper\Dipper as Dipper;
// now you can convert YAML into PHP
$php = Dipper::parse($yaml);
// or you can convert PHP into YAML
$yaml = Dipper::make($php);
Thatâs all there is to it., (*11)
What It Will Parse
Below is a complete list of the subset of YAML that Dipper will parse., (*12)
Strings
string: this is a string
single_quoted_string: 'this is a single-quoted string'
double_quoted_string: "this is a double-quoted string"
string_with_single_quote: this's a string with a single quote
string_with_colon: 'this is a string: containing a colon'
string_with_comma: F j, Y
quoted_number: "120"
quoted_true: "true"
quoted_false: "false"
url: http://something.com
Scalars
literal_scalar: |
This is a scalar
that will preserve
its line breaks.
folding_scalar: >
This is a scalar
that will fold up
line breaks.
Numbers
integer: 42 # becomes an integer
float: -12.12 # becomes a float
octal: 0755 # YAML 1.0-style, becomes an integer, converted from octal
also_octal: 0o755 # YAML 1.2-style, becomes an integer, converted from octal
hex: 0xff # becomes an integer, converted from hexadecimal
infinite: (inf) # YAML 1.0-style, becomes INF, PHP constant for infinity
minus_inf: (-inf) # YAML 1.0-style, becomes -INF, PHP constant for negative infinity
not_a_number: (NaN) # YAML 1.0-style, becomes NAN, PHP constant for not-a-number
also_infinity: .inf # YAML 1.2-style, becomes INF, PHP constant for infinity
also_minus_inf: -.inf # YAML 1.2-style, becomes -INF, PHP constant for negative infinity
also_nan: .NaN # YAML 1.2-style, becomes NAN, PHP constant for not-a-number
Booleans & Null Values
bool_true: true # becomes true (as a boolean)
bool_false: false # becomes false (as a boolean)
null_value: null # becomes a PHP null value
shorthand_null: ~ # becomes a PHP null value
empty_value: # becomes a PHP null value
Lists
regular_list:
- first item
- second item
- third item
shorthand_list: [ first item, second item, third item ]
Maps
regular_map:
one: first
two: second
shorthand_map: { one: first, two: second }
Combinations of These
In addition to each of these elements individually, you can also combine and nest them as youâd expect to create more complex structures.
Shorthand versions of lists and maps shouldnât nest other lists or maps., (*13)
What it Makes
Below is a complete list of the PHP that Dipper will build from the YAML passed to it., (*14)
- strings
- integers
- floats (including any float constants)
- booleans
- null values
- empty strings
- sequential arrays (into lists)
- associative arrays (into maps)
- objects (if theyâve implemented
__toString
)
Notes
- Like SPYC and Symfonyâs code, Dipper also supports the
syck
YAML parsing extension for PHP if itâs installed and enabled on your server. This moves YAML parsing down to the system level, resulting in parsing that is much, much faster than what straight PHP code itself can deliver.
- In addition to YAML, we also really like Markdown. To better support Markdown, literal scalars will not right-trim each line for extra whitespace, allowing you to define Markdown-style new lines by ending a line with two spaces.
Thanks
A special thank you to Thomas Weinert for doing the leg work of getting dipper into the composer, travis, and phpunit arenas., (*15)