Memory Management
- Users/admins can also specify the maximum virtual memory of the launched child-task, and any sub-process it launches recursively, using
mapreduce.{map|reduce}.memory.mb
. - Note that the value set here is a per process limit. The value for mapreduce.{map|reduce}.memory.mb should be specified in mega bytes (MB).
And also the value must be greater than or equal to the -Xmx passed to JavaVM, else the VM might not start.
Note: mapreduce.{map|reduce}.java.opts are used only for configuring the launched child tasks from MRAppMaster. Configuring the memory options for daemons is documented in Configuring the Environment of the Hadoop Daemons.
The memory available to some parts of the framework is also configurable.
- In map and reduce tasks, performance may be influenced by adjusting parameters influencing the concurrency of operations and the frequency with which data will hit disk.
- Monitoring the filesystem counters for a job- particularly relative to byte counts from the map and into the reduce- is invaluable to the tuning of these parameters.