Introduction to Pkg Update
Welcome to Pkg Update. This new blog focuses on the Julia package ecosystem: how packages/organizations are changing, what new projects you should be aware of, and how Base issues are propagating through the package-sphere. I hope that this will be a nice summary of the vast changes that happen throughout the Julia ecosystem. I want this to be a resource which both package users and package developers could read to get a general understanding of the big movements in the community. The goal is to link together issues/PRs/discussions from the the vast number of repositories to give a coherent idea of what changes to anticipate and congratulate. As Base continues to slim (for those who don't know, there's a trend for Base to become slimmer for a leaner install, with more functionality provided by the Julia organizations such as JuliaMath), I hope this helps you stay up to date with where everything is going!
While I try my best to be active throughout the Julia-sphere, this is not something that can be done alone. Please feel free to contact me via email, Twitter, or Gitter to give a summary of what's going on in your organization / area of expertise. Also, if anyone is willing to join the Pkg Update team, I would be happy to have you! I am looking for correspondents to represent different Julia organizations and to help summarize the changes.
That said, here's a quick highlight of some recent updates both users and developers might need to know.
JuliaMath: Exciting Developments in LibM.jl
I have to give a shoutout to @musm for his incredible work in Libm.jl. This project started with a discussion on whether Julia should continue to use OpenLibm (Libm is the math library which supplies the approximations for common functions like "sin" and "exp"). From this discussion the ambitious idea of creating a pure-Julia Libm was put forward, and Libm.jl has continued with the momentum.
Sleef is a C library notable for it's use of SIMD. Libm.jl is largely based on a port of that code to Julia, it also has some added safety and correctness for edge cases over the original code. While the ported code is based on the non-SIMD parts of the C source, it reliably takes advantage f SIMD hardware none-the-less -- thanks to LLVM autovectorisation . The code is particularly vectorisation friendly, as it uses next-to-no branches or table lookups Most of the benchmarks for this implementation are within 2x of C programs calling Sleef. In practice, because of how lining and FFI works, this means that using the julia Sleef implementation is significantly faster than ccalling the C sleef library. Not to mention safer.
Other libraries (Musl and Cephes) now have partial ports to Julia as well. There is work being done to use the strategies of these libraries together to build an even better Libm which might be dubbed Amal.jl. This library, being a Julia implementation, will have much broader support for the type system, including native implementations for Float16 and quad-precision floats, leading to a much more robust mathematical foundation.
JuliaStats API Decision: Degrees of Freedom as "dof"
This issue and the associated PR went through rather quickly. It highlighted the fact that until now, degrees of freedom in the stats packages had a tendency to use "df", which is a naming conflict with using "df" as an instance for a DataFrame (a very common usage). The decision was to change the naming for degrees of freedom to dof. Since this change is in StatsBase, you can expect that this convention will propagate throughout the statistics stack: "df" for dataframes and "dof" for degrees of freedom.
JuliaDiffEq: Upcoming Sundials API Update
Sundials.jl, the interface to the popular ODE suite, has recently undergone a large overhaul to help improve the wrapper and to stop memory leaks. This is being combined with major changes to the "simplified interface" and an upgrade in the Sundials version. Thus the upcoming tag will be a minor release with some breaking but exciting improvements, including the ability to use the Sundials ARKODE solvers.
JunoLab: The Plot Pane is Taking Off
With the Julia v0.5 release, Juno released a new version of their stack which included an updated plot pane. The developers were rallied and many plotting packages now support the plot pane. This includes support from Plots.jl, meaning that the plots will plot in the plot pane if the backend is compatible. While Plotly is not compatible with the plot pane, a recent PR has added plot pane compatibility to PlotlyJS. PyPlot and GR are compatible with the plot pane. Also, compatibility with Gadfly has been added. This means that, at least on the master version of the Juno stack and the plotting packages, the plot pane is all working! For regular users you may want to wait for this all to be released, but if you're brave, it's already ready for testing and I have been making good use of it. Note that in Plots.jl you can still use the `gui()` command to open up the non-plot pane window. This can be useful since some plotting backends (like PyPlot) are not interactive in the plot pane.
JuliaML Manifesto Released, and Major Progress Ensues
If you haven't read Tom Brelof's JuliaML manifesto, I would take a good look at it. He gives an introduction to JuliaML, the new machine learning based Julia organization, and shows how the Learn ecosystem is leading to an easy-to-use but highly extendable API. If you're interested in machine learning, this is an organization to start following. Lots of progress is happening over here.
New Package Highlights
Lastly, I would like to finish off each post by highlighting some newer and promising projects. Since this is the first post and there's lots to say, I picked a few.
ParallelDataTransfer.jl
(Disclaimer: this is one of my own libraries) ParallelDataTransfer.jl has hit the top of the 14-day change chart on Julia Pulse, rapidly picking up steam since it's new release. The package is pretty simple: it offers a easy-to-use API for performing safe data transfers between processes. The README also shows how to use the library within type-stable functions, giving an easy way to construct highly parallel Julia algorithms.
Query.jl
Another package showing large movement on the Julia Pulse is Query.jl. This package allows you to query from Julia data sources, including any iterable, DataFrames, and Arrays. It even allows for queries through DataStreams sources, effectively connecting the queries to CSVs and SQLite databases. It's not hard to see the promising future that has.
RNG.jl
A Google Summer of Code project which has caught my attention is RNG.jl. This package does exactly what you'd expect: it implements a bunch of random number generators. The benchmarks show that there are some promising RNGs which are both fast and suitable for parallel usage. I could see this library's RNGs as becoming a staple for many Julia packages.
That's the End of the First Post!
Let me know what you think of this new blog. I tried to cover as wide of a range as I could, but to get a better sense of the even larger community, I will need your help. Let me know if you're willing to be an organization correspondent who can write one of many (multiple) dispatches, or if you don't have the time, feel free to submit stories when possible.
Michael Borregaard says
Thanks a lot for this! I find this initiative, and the first overview, to be an amazing resource. I hope you can keep up the steam!
I personally think that, apart from the speedup, the major advantage the julia ecosystem has over e.g. R is the foundation in GitHub collaboration, which means that there is a lot of integration and communication across different packages (also reflected in the use of organizations like JuliaML). The R ecosystem on the other hand is much more fragmented and based on individual package projects, that are often not mutually compatible.
I think a blog like the present will strengthen just that collaboration and coherence that I dearly hope will continue to characterize the julia ecosystem.
Tamas Papp says
Thanks for starting this blog, this would be a great way to keep up with changes.
ArchieCall says
Chris,
Thank you for your efforts here. This surely helps the Julia community.
Philippe Roy says
Really helpful! Thanks!
Paul S says
Very useful. Thanks.
Joshua Ballanco says
Wow! This was a much needed blog in the Julia world, and I'm very happy to see it get off the ground. One suggestion: would you consider publishing it as a newsletter in parallel with publishing online? I'd love to get these updates in my inbox.
ChrisRackauckas says
Hey, I am not familiar with making newsletters but think this is a good idea. I am looking into it. Thanks for the suggestion!