Chapter 17 High Availability and Scalability

Question

17.2.5.1.

Can memcached be run on a Windows environment?

Answer 1

No. Currently memcached is available only on the Unix/Linux platform. There is an unofficial port available, see http://www.codeplex.com/memcachedproviders.

Answer 2

The default maximum object size is 1MB. In memcached 1.4.2 and later, you can change the maximum size of an object using the -I command line option.

For versions before this, to increase this size, you have to re-compile memcached. You can modify the value of the POWER_BLOCK within the slabs.c file within the source.

In memcached 1.4.2 and higher, you can configure the maximum supported object size by using the -I command-line option. For example, to increase the maximum object size to 5MB:

$ memcached -I 5m

If an object is larger than the maximum object size, you must manually split it. memcached is very simple: you give it a key and some data, it tries to cache it in RAM. If you try to store more than the default maximum size, the value is just truncated for speed reasons.

Answer 3

Yes. memcached plays no role in database writes, it is a method of caching data already read from the database in RAM.

Answer 4

If you don't use persistent connections when communicating with memcached, there will be a small increase in the latency of opening the connection each time. The effect is comparable to use nonpersistent connections with MySQL.

In general, the chance of locking or other issues with persistent connections is minimal, because there is very little locking within memcached. If there is a problem, eventually your request will time out and return no result, so your application will need to load from MySQL again.

Answer 5

There is no automatic handling of this. If your client fails to get a response from a server, code a fallback mechanism to load the data from the MySQL database.

The client APIs all provide the ability to add and remove memcached instances on the fly. If within your application you notice that memcached server is no longer responding, you can remove the server from the list of servers, and keys will automatically be redistributed to another memcached server in the list. If retaining the cache content on all your servers is important, make sure you use an API that supports a consistent hashing algorithm. For more information, see Section 17.2.2.5, “memcached Hashing/Distribution Types”.

Answer 6

memcached has a very low processing overhead. All that is required is spare physical RAM capacity. A memcached server does not require a dedicated machine. If you have web, application, or database servers that have spare RAM capacity, then use them with memcached.

To build and deploy a dedicated memcached server, use a relatively low-power CPU, lots of RAM, and one or more Gigabit Ethernet interfaces.

Answer 7

memcached works equally well for all kinds of data. To memcached, any value you store is just a stream of data. Remember, though, that the maximum size of an object you can store in memcached is 1MB, but can be configured to be larger by using the -I option in memcached 1.4.2 and later, or by modifying the source in versions before 1.4.2. If you plan on using memcached with audio and video content, you will probably want to increase the maximum object size. Also remember that memcached is a solution for caching information for reading. It shouldn't be used for writes, except when updating the information in the cache.

Answer 8

There are ports and interfaces for many languages and environments. ASPX relies on an underlying language such as C# or VisualBasic, and if you are using ASP.NET then there is a C# memcached library. For more information, see https://sourceforge.net/projects/memcacheddotnet/.

Answer 9

Opening the connection is relatively inexpensive, because there is no security, authentication or other handshake taking place before you can start sending requests and getting results. Most APIs support a persistent connection to a memcached instance to reduce the latency. Connection pooling would depend on the API you are using, but if you are communicating directly over TCP/IP, then connection pooling would provide some small performance benefit.

Answer 10

The behavior is entirely application dependent. Most applications fall back to loading the data from the database (just as if they were updating the memcached information). If you are using multiple memcached servers, you might also remove a downed server from the list to prevent it from affecting performance. Otherwise, the client will still attempt to communicate with the memcached server that corresponds to the key you are trying to load.

Answer 11

They aren't. There is no relationship between MySQL and memcached unless your application (or, if you are using the MySQL UDFs for memcached, your database definition) creates one.

If you are storing information based on an auto-increment key into multiple instances of memcached, the information is only stored on one of the memcached instances anyway. The client uses the key value to determine which memcached instance to store the information. It doesn't store the same information across all the instances, as that would be a waste of cache memory.

Answer 12

Yes. Most of the client APIs support some sort of compression, and some even allow you to specify the threshold at which a value is deemed appropriate for compression during storage.

Answer 13

Yes. You can run multiple instances of memcached on a single server, and in your client configuration you choose the list of servers you want to use.

Answer 14

The best way to test the performance is to start up a memcached instance. First, modify your application so that it stores the data just before the data is about to be used or displayed into memcached. Since the APIs handle the serialization of the data, it should just be a one-line modification to your code. Then, modify the start of the process that would normally load that information from MySQL with the code that requests the data from memcached. If the data cannot be loaded from memcached, default to the MySQL process.

All of the changes required will probably amount to just a few lines of code. To get the best benefit, make sure you cache entire objects (for example, all the components of a web page, blog post, discussion thread, and so on), rather than using memcached as a simple cache of individual rows of MySQL tables.

Keeping the configuration simple at the start, or even over the long term, is easy with memcached. Once you have the basic structure up and running, often the only ongoing change is to add more servers into the list of servers used by your applications. You don't need to manage the memcached servers, and there is no complex configuration; just add more servers to the list and let the client API and the memcached servers make the decisions.

Requirement	MySQL Replication	MySQL Cluster
Availability
Platform Support	All Supported by MySQL Server (http://www.mysql.com/support/supportedplatforms/database.html)	All Supported by MySQL Cluster (http://www.mysql.com/support/supportedplatforms/cluster.html)
Automated IP Failover	No	Depends on Connector and Configuration
Automated Database Failover	No	Yes
Automatic Data Resynchronization	No	Yes
Typical Failover Time	User / Script Dependent	1 Second and Less
Synchronous Replication	No, Asynchronous and Semisynchronous	Yes
Shared Storage	No, Distributed	No, Distributed
Geographic redundancy support	Yes	Yes, via MySQL Replication
Update Schema On-Line	No	Yes
Scalability
Number of Nodes	One Master, Multiple Slaves	255
Built-in Load Balancing	Reads, via MySQL Replication	Yes, Reads and Writes
Supports Read-Intensive Workloads	Yes	Yes
Supports Write-Intensive Workloads	Yes, via Application-Level Sharding	Yes, via Auto-Sharding
Scale On-Line (add nodes, repartition, etc.)	No	Yes

`libmemcached` Function	Equivalent Core Function
`memcached_set(memc, key, key_length, value, value_length, expiration, flags)`	Generic `set()` operation.
`memcached_add(memc, key, key_length, value, value_length, expiration, flags)`	Generic `add()` function.
`memcached_replace(memc, key, key_length, value, value_length, expiration, flags)`	Generic `replace()`.
`memcached_prepend(memc, key, key_length, value, value_length, expiration, flags)`	Prepends the specified `value` before the current value of the specified `key`.
`memcached_append(memc, key, key_length, value, value_length, expiration, flags)`	Appends the specified `value` after the current value of the specified `key`.
`memcached_cas(memc, key, key_length, value, value_length, expiration, flags, cas)`	Overwrites the data for a given key as long as the corresponding `cas` value is still the same within the server.
`memcached_set_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the generic `set()`, but has the option of an additional master key that can be used to identify an individual server.
`memcached_add_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the generic `add()`, but has the option of an additional master key that can be used to identify an individual server.
`memcached_replace_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the generic `replace()`, but has the option of an additional master key that can be used to identify an individual server.
`memcached_prepend_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the `memcached_prepend()`, but has the option of an additional master key that can be used to identify an individual server.
`memcached_append_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the `memcached_append()`, but has the option of an additional master key that can be used to identify an individual server.
`memcached_cas_by_key(memc, master_key, master_key_length, key, key_length, value, value_length, expiration, flags)`	Similar to the `memcached_cas()`, but has the option of an additional master key that can be used to identify an individual server.

Behavior	Description
`MEMCACHED_BEHAVIOR_NO_BLOCK`	Caused `libmemcached` to use asynchronous I/O.
`MEMCACHED_BEHAVIOR_TCP_NODELAY`	Turns on no-delay for network sockets.
`MEMCACHED_BEHAVIOR_HASH`	Without a value, sets the default hashing algorithm for keys to use MD5. Other valid values include `MEMCACHED_HASH_DEFAULT`, `MEMCACHED_HASH_MD5`, `MEMCACHED_HASH_CRC`, `MEMCACHED_HASH_FNV1_64`, `MEMCACHED_HASH_FNV1A_64`, `MEMCACHED_HASH_FNV1_32`, and `MEMCACHED_HASH_FNV1A_32`.
`MEMCACHED_BEHAVIOR_DISTRIBUTION`	Changes the method of selecting the server used to store a given value. The default method is `MEMCACHED_DISTRIBUTION_MODULA`. You can enable consistent hashing by setting `MEMCACHED_DISTRIBUTION_CONSISTENT`. `MEMCACHED_DISTRIBUTION_CONSISTENT` is an alias for the value `MEMCACHED_DISTRIBUTION_CONSISTENT_KETAMA`.
`MEMCACHED_BEHAVIOR_CACHE_LOOKUPS`	Cache the lookups made to the DNS service. This can improve the performance if you are using names instead of IP addresses for individual hosts.
`MEMCACHED_BEHAVIOR_SUPPORT_CAS`	Support CAS operations. By default, this is disabled because it imposes a performance penalty.
`MEMCACHED_BEHAVIOR_KETAMA`	Sets the default distribution to `MEMCACHED_DISTRIBUTION_CONSISTENT_KETAMA` and the hash to `MEMCACHED_HASH_MD5`.
`MEMCACHED_BEHAVIOR_POLL_TIMEOUT`	Modify the timeout value used by `poll()`. Supply a `signed int` pointer for the timeout value.
`MEMCACHED_BEHAVIOR_BUFFER_REQUESTS`	Buffers IO requests instead of them being sent. A get operation, or closing the connection causes the data to be flushed.
`MEMCACHED_BEHAVIOR_VERIFY_KEY`	Forces `libmemcached` to verify that a specified key is valid.
`MEMCACHED_BEHAVIOR_SORT_HOSTS`	If set, hosts added to the list of configured hosts for a `memcached_st` structure are placed into the host list in sorted order. This breaks consistent hashing if that behavior has been enabled.
`MEMCACHED_BEHAVIOR_CONNECT_TIMEOUT`	In nonblocking mode this changes the value of the timeout during socket connection.

`Cache::Memcached` Function	Equivalent Generic Method
`get()`	Generic `get()`.
`get_multi(keys)`	Gets multiple `keys` from memcache using just one query. Returns a hash reference of key/value pairs.
`set()`	Generic `set()`.
`add()`	Generic `add()`.
`replace()`	Generic `replace()`.
`delete()`	Generic `delete()`.
`incr()`	Generic `incr()`.
`decr()`	Generic `decr()`.

Python `memcache` Function	Equivalent Generic Function
`get()`	Generic `get()`.
`get_multi(keys)`	Gets multiple values from the supplied array of `keys`. Returns a hash reference of key/value pairs.
`set()`	Generic `set()`.
`set_multi(dict [, expiry [, key_prefix]])`	Sets multiple key/value pairs from the supplied `dict`.
`add()`	Generic `add()`.
`replace()`	Generic `replace()`.
`prepend(key, value [, expiry])`	Prepends the supplied `value` to the value of the existing `key`.
`append(key, value [, expiry[)`	Appends the supplied `value` to the value of the existing `key`.
`delete()`	Generic `delete()`.
`delete_multi(keys [, expiry [, key_prefix]] )`	Deletes all the keys from the hash matching each string in the array `keys`.
`incr()`	Generic `incr()`.
`decr()`	Generic `decr()`.

Configuration option	Default	Description
`memcache.allow_failover`	1	Specifies whether another server in the list should be queried if the first server selected fails.
`memcache.max_failover_attempts`	20	Specifies the number of servers to try before returning a failure.
`memcache.chunk_size`	8192	Defines the size of network chunks used to exchange data with the memcached server.
`memcache.default_port`	11211	Defines the default port to use when communicating with the memcached servers.
`memcache.hash_strategy`	standard	Specifies which hash strategy to use. Set to `consistent` to enable servers to be added or removed from the pool without causing the keys to be remapped to other servers. When set to `standard`, an older (modula) strategy is used that potentially uses different servers for storage.
`memcache.hash_function`	crc32	Specifies which function to use when mapping keys to servers. `crc32` uses the standard CRC32 hash. `fnv` uses the FNV-1a hashing algorithm.

Ruby `MemCache` Method	Equivalent memcached API Functions
`get()`	Generic `get()`.
`get_hash(keys)`	Get the values of multiple `keys`, returning the information as a hash of the keys and their values.
`set()`	Generic `set()`.
`set_many(pairs)`	Set the values of the keys and values in the hash `pairs`.
`add()`	Generic `add()`.
`replace()`	Generic `replace()`.
`delete()`	Generic `delete()`.
`incr()`	Generic `incr()`.
`decr()`	Generic `decr()`.

Java `com.danga.MemCached` Method	Equivalent Generic Method
`get()`	Generic `get()`.
`getMulti(keys)`	Get the values of multiple `keys`, returning the information as Hash map using `java.lang.String` for the keys and `java.lang.Object` for the corresponding values.
`set()`	Generic `set()`.
`add()`	Generic `add()`.
`replace()`	Generic `replace()`.
`delete()`	Generic `delete()`.
`incr()`	Generic `incr()`.
`decr()`	Generic `decr()`.

Command	Command Formats
`set`	`set key flags exptime length`, `set key flags exptime length noreply`
`add`	`add key flags exptime length`, `add key flags exptime length noreply`
`replace`	`replace key flags exptime length`, `replace key flags exptime length noreply`
`append`	`append key length`, `append key length noreply`
`prepend`	`prepend key length`, `prepend key length noreply`
`cas`	`cas key flags exptime length casunique`, `cas key flags exptime length casunique noreply`
`get`	`get key1 [key2 ... keyn]`
`gets`
`delete`	`delete key`, `delete key noreply`, `delete key expiry`, `delete key expiry noreply`
`incr`	`incr key`, `incr key noreply`, `incr key value`, `incr key value noreply`
`decr`	`decr key`, `decr key noreply`, `decr key value`, `decr key value noreply`
`stat`	`stat`, `stat name`, `stat name value`

String	Description
`STORED`	Value has successfully been stored.
`NOT_STORED`	The value was not stored, but not because of an error. For commands where you are adding a or updating a value if it exists (such as `add` and `replace`), or where the item has already been set to be deleted.
`EXISTS`	When using a `cas` command, the item you are trying to store already exists and has been modified since you last checked it.
`NOT_FOUND`	The item you are trying to store, update or delete does not exist or has already been deleted.
`ERROR`	You submitted a nonexistent command name.
`CLIENT_ERROR errorstring`	There was an error in the input line, the detail is contained in `errorstring`.
`SERVER_ERROR errorstring`	There was an error in the server that prevents it from returning the information. In extreme conditions, the server may disconnect the client after this error occurs.
`VALUE keys flags length`	The requested key has been found, and the stored `key`, `flags` and data block are returned, of the specified `length`.
`DELETED`	The requested key was deleted from the server.
`STAT name value`	A line of statistics data.
`END`	The end of the statistics data.

Statistic	Data type	Description	Version
`pid`	32u	Process ID of the memcached instance.
`uptime`	32u	Uptime (in seconds) for this memcached instance.
`time`	32u	Current time (as epoch).
`version`	string	Version string of this instance.
`pointer_size`	string	Size of pointers for this host specified in bits (32 or 64).
`rusage_user`	32u:32u	Total user time for this instance (seconds:microseconds).
`rusage_system`	32u:32u	Total system time for this instance (seconds:microseconds).
`curr_items`	32u	Current number of items stored by this instance.
`total_items`	32u	Total number of items stored during the life of this instance.
`bytes`	64u	Current number of bytes used by this server to store items.
`curr_connections`	32u	Current number of open connections.
`total_connections`	32u	Total number of connections opened since the server started running.
`connection_structures`	32u	Number of connection structures allocated by the server.
`cmd_get`	64u	Total number of retrieval requests (`get` operations).
`cmd_set`	64u	Total number of storage requests (`set` operations).
`get_hits`	64u	Number of keys that have been requested and found present.
`get_misses`	64u	Number of items that have been requested and not found.
`delete_hits`	64u	Number of keys that have been deleted and found present.	1.3.x
`delete_misses`	64u	Number of items that have been delete and not found.	1.3.x
`incr_hits`	64u	Number of keys that have been incremented and found present.	1.3.x
`incr_misses`	64u	Number of items that have been incremented and not found.	1.3.x
`decr_hits`	64u	Number of keys that have been decremented and found present.	1.3.x
`decr_misses`	64u	Number of items that have been decremented and not found.	1.3.x
`cas_hits`	64u	Number of keys that have been compared and swapped and found present.	1.3.x
`cas_misses`	64u	Number of items that have been compared and swapped and not found.	1.3.x
`cas_badvalue`	64u	Number of keys that have been compared and swapped, but the comparison (original) value did not match the supplied value.	1.3.x
`evictions`	64u	Number of valid items removed from cache to free memory for new items.
`bytes_read`	64u	Total number of bytes read by this server from network.
`bytes_written`	64u	Total number of bytes sent by this server to network.
`limit_maxbytes`	32u	Number of bytes this server is permitted to use for storage.
`threads`	32u	Number of worker threads requested.
`conn_yields`	64u	Number of yields for connections (related to the `-R` option).	1.4.0

Statistic	Description	Version
`chunk_size`	Space allocated to each chunk within this slab class.
`chunks_per_page`	Number of chunks within a single page for this slab class.
`total_pages`	Number of pages allocated to this slab class.
`total_chunks`	Number of chunks allocated to the slab class.
`used_chunks`	Number of chunks allocated to an item..
`free_chunks`	Number of chunks not yet allocated to items.
`free_chunks_end`	Number of free chunks at the end of the last allocated page.
`get_hits`	Number of get hits to this chunk	1.3.x
`cmd_set`	Number of set commands on this chunk	1.3.x
`delete_hits`	Number of delete hits to this chunk	1.3.x
`incr_hits`	Number of increment hits to this chunk	1.3.x
`decr_hits`	Number of decrement hits to this chunk	1.3.x
`cas_hits`	Number of CAS hits to this chunk	1.3.x
`cas_badval`	Number of CAS hits on this chunk where the existing value did not match	1.3.x
`mem_requested`	The true amount of memory of memory requested within this chunk	1.4.1

Statistic	Description	Version
`active_slabs`	Total number of slab classes allocated.
`total_malloced`	Total amount of memory allocated to slab pages.

Statistic	Description
`number`	The number of items currently stored in this slab class.
`age`	The age of the oldest item within the slab class, in seconds.
`evicted`	The number of items evicted to make way for new entries.
`evicted_time`	The time of the last evicted entry
`evicted_nonzero`	The time of the last evicted non-zero entry	1.4.0
`outofmemory`	The number of items for this slab class that have triggered an out of memory error (only value when the `-M` command line option is in effect).
`tailrepairs`	Number of times the entries for a particular ID need repairing

Prev	Up	Next
Chapter 16 Alternative Storage Engines	Home	Chapter 18 Replication

Chapter 17 High Availability and Scalability

17.1 Using ZFS Replication

17.1.1 Using ZFS for File System Replication

17.1.2 Configuring MySQL for ZFS Replication

17.1.3 Handling MySQL Recovery with ZFS

17.2 Using MySQL with memcached

17.2.1 Installing memcached

17.2.2 Using memcached

17.2.2.1 memcached Command-Line Options

17.2.2.2 memcached Deployment

17.2.2.3 Using Namespaces

17.2.2.4 Data Expiry

17.2.2.5 memcached Hashing/Distribution Types

17.2.2.6 Using memcached and DTrace

17.2.2.7 Memory Allocation within memcached

17.2.2.8 memcached Thread Support

17.2.2.9 memcached Logs

17.2.3 Developing a memcached Application

17.2.3.1 Basic memcached Operations

17.2.3.2 Using memcached as a MySQL Caching Layer

Adapting Database Best Practices to memcached Applications

17.2.3.3 Using libmemcached with C and C++

17.2.3.3.1 libmemcached Base Functions

17.2.3.3.2 libmemcached Server Functions

17.2.3.3.3 libmemcached Set Functions

17.2.3.3.4 libmemcached Get Functions

17.2.3.3.5 Controlling libmemcached Behaviors

17.2.3.3.6 libmemcached Command-Line Utilities

17.2.3.4 Using MySQL and memcached with Perl

17.2.3.5 Using MySQL and memcached with Python

17.2.3.6 Using MySQL and memcached with PHP

17.2.3.7 Using MySQL and memcached with Ruby

17.2.3.8 Using MySQL and memcached with Java

17.2.3.9 Using the memcached TCP Text Protocol

17.2.4 Getting memcached Statistics

17.2.4.1 memcached General Statistics

17.2.4.2 memcached Slabs Statistics

17.2.4.3 memcached Item Statistics

17.2.4.4 memcached Size Statistics

17.2.4.5 memcached Detail Statistics

17.2.4.6 Using memcached-tool

17.2.5 memcached FAQ

17.2.3.3 Using `libmemcached` with C and C++

17.2.3.3.1 `libmemcached` Base Functions

17.2.3.3.2 `libmemcached` Server Functions

17.2.3.3.3 `libmemcached` Set Functions

17.2.3.3.4 `libmemcached` Get Functions

17.2.3.3.5 Controlling `libmemcached` Behaviors

17.2.4.5 `memcached` Detail Statistics