upb.git - a small protobuf implementation in C

Age	Commit message (Collapse)	Author
2013-02-15	Sync with 8 months of Google-internal development.	Josh Haberman
	Many things have changed and been simplified. The memory-management story for upb_def and upb_handlers is much more robust; upb_def and upb_handlers should be fairly stable interfaces now. There is still much work to do for the runtime component (upb_sink).
2012-03-31	Sync from internal Google development.	Joshua Haberman

2012-03-24	Sync from internal Google development.	Joshua Haberman
	Many improvements, too many to mention. One significant perf regression warrants investigation: omitfp.parsetoproto2_googlemessage1.upb_jit: 343 -> 252 (-26.53) plain.parsetoproto2_googlemessage1.upb_jit: 334 -> 251 (-24.85) 25% regression for this benchmark is bad, but since I don't think there's any fundamental design issue that caused it I'm going to go ahead with the commit anyway. Can investigate and fix later. Other benchmarks were neutral or showed slight improvement.
2011-12-22	Fixed the open-source build.	Joshua Haberman

2011-12-22	Sync with internal Google development.	Joshua Haberman
	This breaks the open-source build, will follow up with a change to fix it.
2011-11-23	Refinement of upb_bytesrc interface.	Joshua Haberman
	Added a upb_byteregion that tracks a region of the input buffer; decoders use this instead of using a upb_bytesrc directly. upb_byteregion is also used as the way of passing a string to a upb_handlers callback. This symmetry makes decoders compose better; if you want to take a parsed string and decode it as something else, you can take the string directly from the callback and feed it as input to another parser. A commented-out version of a pinning interface is present; I decline to actually implement it (and accept its extra complexity) until/unless it is clear that it is actually a win. But it is included as a proof-of-concept, to show that it fits well with the existing interface.
2011-09-04	Const invasion: large parts of upb made const-correct.	Joshua Haberman

2011-08-31	Header cleanup, clarify/correct comments for interfaces.	Joshua Haberman

2011-08-27	Refined upb_status.	Joshua Haberman

2011-08-22	Add comment clarifying that the proto2 benchmark is ugly and temporary.	Joshua Haberman

2011-08-20	Let the JIT emit hasbit-setting code in addition to calling a callback.	Joshua Haberman
	This leads to a major (20-40%) improvement in the parsetoproto2 benchmark with small messages. We now are faster than proto2 in all apples-to-apples comparisons, at least given the (admittedly limited) set of benchmarks in this source tree.
2011-08-19	Some source cleanup/commenting.	Joshua Haberman

2011-08-19	Fix proto2 benchmark for newer proto2.	Josh Haberman

2011-08-19	Benchmark to parse into proto2 messages.	Joshua Haberman

2011-08-17	Benchmark JIT vs no JIT without forcing a whole-project recompile.	Joshua Haberman

2011-07-15	Directory restructure.	Joshua Haberman
	Includes are now via upb/foo.h. Files specific to the protobuf format are now in upb/pb (the core library is concerned with message definitions, handlers, and byte streams, but knows nothing about any particular serializationf format).
2011-07-14	Major refactoring: upb_string is gone in favor of upb_strref.	Joshua Haberman

2011-06-17	Major refactoring: abandon upb_msg, add upb_accessors.	Joshua Haberman
	Next on the chopping block is upb_string.
2011-05-21	Make all handlers objects refcounted.	Joshua Haberman
	I'm realizing that basically all upb objects will need to be refcounted to be sharable across languages, but not messages which are on their way out so we can get out of the business of data representations. Things which must be refcounted: - encoders, decoders - handlers objects - defs
2011-05-21	Add proof-of-concept C++ wrapper header.	Joshua Haberman

2011-05-10	Vastly improved/simplified the upb_handlers API.	Joshua Haberman

2011-05-08	Documentation, some type renaming, nix unknown handler for now.	Joshua Haberman

2011-04-01	First rough version of the JIT.	Joshua Haberman
	It can successfully parse SpeedMessage1. Preliminary results: 750MB/s on Core2 2.4GHz. This number is 2.5x proto2. This isn't apples-to-apples, because proto2 is parsing to a struct and we are just doing stream parsing, but for apps that are currently using proto2, this is the improvement they would see if they could move to stream-based processing. Unfortunately perf-regression-test.py is broken, and I'm not 100% sure why. It would be nice to fix it first (to ensure that there are no performance regressions for the table-based decoder) but I'm really impatient to get the JIT checked in.
2011-03-20	upb_stream: all callbacks registered ahead-of-time.	Josh Haberman
	This is a significant change to the upb_stream protocol, and should hopefully be the last significant change. All callbacks are now registered ahead-of-time instead of having delegated callbacks registered at runtime, which makes it much easier to aggressively optimize ahead-of-time (like with a JIT). Other impacts of this change: - You no longer need to have loaded descriptor.proto as a upb_def to load other descriptors! This means the special-case code we used for bootstrapping is no longer necessary, and we no longer need to link the descriptor for descriptor.proto into upb. - A client can now register any upb_value as what will be delivered to their value callback, not just a upb_fielddef*. This should allow for other clients to get more bang out of the streaming decoder. This change unfortunately causes a bit of a performance regression -- I think largely due to highly suboptimal code that GCC generates when structs are returned by value. See: http://blog.reverberate.org/2011/03/19/when-a-compilers-slow-code-actually-bites-you/ On the other hand, once we have a JIT this should no longer matter. Performance numbers: plain.parsestream_googlemessage1.upb_table: 374 -> 396 (5.88) plain.parsestream_googlemessage2.upb_table: 616 -> 449 (-27.11) plain.parsetostruct_googlemessage1.upb_table_byref: 268 -> 269 (0.37) plain.parsetostruct_googlemessage1.upb_table_byval: 215 -> 204 (-5.12) plain.parsetostruct_googlemessage2.upb_table_byref: 307 -> 281 (-8.47) plain.parsetostruct_googlemessage2.upb_table_byval: 297 -> 272 (-8.42) omitfp.parsestream_googlemessage1.upb_table: 423 -> 410 (-3.07) omitfp.parsestream_googlemessage2.upb_table: 679 -> 483 (-28.87) omitfp.parsetostruct_googlemessage1.upb_table_byref: 287 -> 282 (-1.74) omitfp.parsetostruct_googlemessage1.upb_table_byval: 226 -> 219 (-3.10) omitfp.parsetostruct_googlemessage2.upb_table_byref: 315 -> 298 (-5.40) omitfp.parsetostruct_googlemessage2.upb_table_byval: 297 -> 287 (-3.37)
2011-02-27	Make proto2 benchmarks use ParsePartial, to make results vs upb more comparable.	Joshua Haberman
	ParsePartial does not check that required fields are set, which upb also currently does not do. This sped up proto2 reflection by 40%, but upb is still 7x faster in a comparable test (instead of 11x, like before).
2011-02-22	Prevent the default message from getting mutated.	Joshua Haberman
	If a Lua program references the default message, it will be copied into a mutable object.
2011-02-18	Change the API for getting the bootstrapped defs.	Joshua Haberman
	The symtab that contains them is now hidden, and you can look them up by name but there is no access to the symtab itself, so there is no risk of mutating it (by extending it, adding other defs to it, etc).
2011-02-14	Make "byval" benchmarks actually byval.	Joshua Haberman

2011-02-09	Cache decoding objects for better benchmark results. (~15%)	Joshua Haberman
	I would prefer to find an API that is both fast and doesn't require this, but we'll do this for now.
2011-02-08	Fix upb's parsetostruct benchmark.	Joshua Haberman

2011-02-06	Benchmarks compile and run again!	Joshua Haberman

2010-07-24	Optimizations and bugfix to benchmark.	Joshua Haberman

2010-07-24	Fixes to benchmark.	Joshua Haberman

2010-07-24	Stream decoding benchmark.	Joshua Haberman

2010-01-15	Remove struct keyword from all types, use typedef instead.	Joshua Haberman

2010-01-15	upb_parser -> upb_decoder	Joshua Haberman

2010-01-11	Defined upb_sink interface and ported the parser to it.	Joshua Haberman

2010-01-05	Merge branch 'data-refactoring'. Ported benchmarks.	Joshua Haberman
	Conflicts: src/upb_data.c src/upb_data.h
2009-12-05	Make defs refcounted, rename upb_context->upbsymtab.	Joshua Haberman
	There is currently a memory leak when type definitions form cycles. This will need to be dealt with.
2009-11-28	Overhaul defs to derive from a common base.	Joshua Haberman

2009-11-26	Fix memory leak in test.	Joshua Haberman

2009-09-26	Use a status object for errors so a message can be returned.	Joshua Haberman
	Also delay deletion of subfields until the entire message is deleted.
2009-08-30	Fixes for more discerning compilers.	Joshua Haberman

2009-08-30	Small stylistic improvements in benchmarks.	Joshua Haberman

2009-08-27	Some cleanup and reformatting, fixed the benchmarks.	Joshua Haberman

2009-08-04	More work on benchmarks (performance tests).	Joshua Haberman

2009-08-04	Integrated benchmarks into main Makefile.	Joshua Haberman