ad.merge function

I would like to add a `anndata.merge` function, with similar functionality to `xarray.merge`.

## Example

```python
>>> expr_adata
AnnData object with n_obs × n_vars = 3000 × 15000
    layers: 'counts'
>>> pca_adata
AnnData object with n_obs × n_vars = 3000 × 15000
    uns: 'pca'
    obsm: 'X_pca'
    varm: 'PCs'
>>> ad.merge([expr_adata, pca_adata])
AnnData object with n_obs × n_vars = 3000 × 15000
    layers: 'counts'
    uns: 'pca'
    obsm: 'X_pca'
    varm: 'PCs'
```

## Use cases

### Partial-AnnDatas returned from functions

Many `scanpy` function take an anndata object, produce a number of elements, and add them back to the original anndata object. We could instead produce a new object which only holds the new elements, then `ad.merge` the results together. By itself, this is the exact same thing, but this refactoring would allow a few new uses.

Instead of updating the original, we could keep the results seperate. This would be useful for generating multiple parameterizations, or having a lightweight object to pass to further objects – as opposed to mutating or copying the whole original object.

### Seperating parts of analyses

We could want to keep elements from annotation or analysis seperate until we need them. We could avoid keeping the large arrays in `layers` for a velocity analysis, until we actually want them.

### scirpy

Scirpy has a function for doing this specifically with immense receptor data: [`scirpy.pp.merge_with_ir`](https://icbi-lab.github.io/scirpy/latest/generated/scirpy.pp.merge_with_ir.html). This would be a more general case. The IR `AnnData` here is a bit like the "partial-AnnDatas" discussed above.

(please let me know if this isn't the case @grst)

## Previous discussion

This has been suggested and discussed a number of places.

* #266
    * The initial intent may be looking towards the backed case.
    * Being able to get the results of a function in an otherwise empty anndata function, then merging these results together would be quite useful.
* #441, though the desired API is more like `ad.merge(adata, {"obs": df})`, i.e. other objects to merge can just be mappings.

## Requirements

This would require full support for `adata.X = None` #467

Implimenting this would fit well with an `anndata.align` (#531) function (e.g. pass multiple anndata objects, return them with axes aligned). As the updates and reindexing are orthogonal.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ad.merge function #658

Example

Use cases

Partial-AnnDatas returned from functions

Seperating parts of analyses

scirpy

Previous discussion

Requirements

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

ad.merge function #658

Description

Example

Use cases

Partial-AnnDatas returned from functions

Seperating parts of analyses

scirpy

Previous discussion

Requirements

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions