부분 문자열 형식

128

문자열 템플릿 safe_substitute()함수 와 유사한 고급 문자열 서식 지정 방법을 사용하여 부분 문자열 서식을 수행 할 수 있습니까?

예를 들면 :

s = '{foo} {bar}'
s.format(foo='FOO') #Problem: raises KeyError 'bar'

python string-formatting

— P3trus
소스

58

매핑을 덮어 써서 부분 서식으로 속일 수 있습니다.

import string

class FormatDict(dict):
    def __missing__(self, key):
        return "{" + key + "}"

s = '{foo} {bar}'
formatter = string.Formatter()
mapping = FormatDict(foo='FOO')
print(formatter.vformat(s, (), mapping))

인쇄

FOO {bar}

물론이 기본 구현은 기본적인 경우에만 올바르게 작동합니다.

— 스벤 마나 흐
소스

7

이 같은 고급 formattings 작동하지 않습니다{bar:1.2f}

— MaxNoe

"가장 기본적인 구현은 기본적인 경우에만 올바르게 작동합니다."라는 말을 이해하지만 형식 사양을 삭제하지 않아도 확장 할 수있는 방법이 있습니까?

— Tadhg McDonald-Jensen

5

@ TadhgMcDonald-Jensen : 네, 방법이 있습니다. 에서 문자열을 반환하는 대신 형식 사양을 포함하여 원래 자리 표시자를 반환하는 방식으로 __missing__()재정의하는 사용자 지정 클래스의 인스턴스 __format__()를 반환합니다. 개념 증명 : ideone.com/xykV7R

— Sven Marnach

@SvenMarnach 왜 당신 의 개념 증명이 대답 본문에 있지 않습니까? 그것은 약간 이해하기 어렵습니다. 홍보를 방해하는 알려진 경고가 있습니까?

— norok2

1

@ norok2 댓글에서 묻는 질문에 대한 답변이라 댓글에 답장을 썼습니다. 원래 질문에는 실제로 그 요구 사항이 포함되어 있지 않았으며 일반적으로 문자열을 부분적으로 형식화하려고 시도하는 것이 약간 이상하다고 생각합니다.

— Sven Marnach

128

서식을 지정하는 순서를 알고있는 경우 :

s = '{foo} {{bar}}'

다음과 같이 사용하십시오.

ss = s.format(foo='FOO') 
print ss 
>>> 'FOO {bar}'

print ss.format(bar='BAR')
>>> 'FOO BAR'

당신은 지정할 수 없습니다 foo와 bar같은 시간에 - 당신은 순차적으로 그것을 할 수 있습니다.

— 아렌
소스

이것의 요점은 무엇입니까? foo와 bar를 모두 지정하면 무엇이든 상관없이 s.format(foo='FOO',bar='BAR')여전히 'FOO {bar}'. 명확히 해주시 겠어요?

— n611x007

10

한 번에 둘 다 작성할 수 없다는 것은 성가신 일입니다. 어떤 이유로 든 문자열을 단계별로 형식화해야하고 해당 스테이지의 순서를 알고있을 때 유용합니다.

— aaren 2013-10-12

1

이 작업을 수행하지 않는 방법을 설계해야 할 수도 있지만, 그렇게해야 할 수도 있습니다.

— aaren 2013-10-12

2

이것에 대해 몰랐습니다. 나는 미니 템플릿으로 "프라임"문자열로 원하는 몇 가지 사용 사례 했어

— ejrb

이것은 코드의 한 부분에서 문자열의 일부를 채울 때 매우 유용하지만 나중에 코드의 다른 부분에서 채울 자리 표시자를 남겨 둡니다.

— Alex Petralia

99

짧고 가독성이 뛰어나며 코더의 의도를 설명하는 partial함수를 사용할 수 있습니다 functools.

from functools import partial

s = partial("{foo} {bar}".format, foo="FOO")
print s(bar="BAR")
# FOO BAR

— 사이키 란 예람
소스

2

가장 짧고 가장 읽기 쉬운 솔루션 일뿐만 아니라 코더의 의도도 설명합니다. Python3 버전 :

python from functool import partial s = "{foo} {bar}".format s_foo = partial(s, foo="FOO") print(s_foo(bar="BAR")) # FOO BAR print(s(foo="FOO", bar="BAR")) # FOO BAR

— Paul Brown

@PaulBrown 사실, 대답은 약간의 사랑이 필요합니다;)

— ypercubeᵀᴹ

8

@ ypercubeᵀᴹ 글쎄, 이것이 대부분의 사람들이 찾는 것이 정확히 맞는지 잘 모르겠습니다. partial()부분적으로 형식이 지정된 문자열 (즉,)로 처리해야하는 경우 도움이되지 않습니다 "FOO {bar}".

— Delgan

1

100 % 제어하지 않는 입력으로 작업 할 때 더 좋습니다. "{foo} {{bar}}".format(foo="{bar}").format(bar="123")다른 예에서 생각해보십시오 . 나는 기대 "{bar} 123"하지만 그들은 출력 "123 123"합니다.

— Benjamin Manns

50

이 제한 .format()부분 대체를 할 수 없다는 저를 괴롭 혔습니다.

Formatter여기에 많은 답변에 설명 된대로 사용자 정의 클래스 작성을 평가 하고 lazy_format 과 같은 타사 패키지 사용을 고려한 후에도 훨씬 간단한 내장 솔루션을 발견했습니다. 템플릿 문자열을

유사한 기능을 제공하지만 부분 대체 safe_substitute()방법을 제공합니다 . 템플릿 문자열에는 $접두사 가 있어야 합니다 (조금 이상하게 느껴지지만 전반적인 솔루션이 더 낫다고 생각합니다).

import string
template = string.Template('${x} ${y}')
try:
  template.substitute({'x':1}) # raises KeyError
except KeyError:
  pass

# but the following raises no error
partial_str = template.safe_substitute({'x':1}) # no error

# partial_str now contains a string with partial substitution
partial_template = string.Template(partial_str)
substituted_str = partial_template.safe_substitute({'y':2}) # no error
print substituted_str # prints '12'

이를 기반으로 편의 래퍼를 형성했습니다.

class StringTemplate(object):
    def __init__(self, template):
        self.template = string.Template(template)
        self.partial_substituted_str = None

    def __repr__(self):
        return self.template.safe_substitute()

    def format(self, *args, **kws):
        self.partial_substituted_str = self.template.safe_substitute(*args, **kws)
        self.template = string.Template(self.partial_substituted_str)
        return self.__repr__()


>>> s = StringTemplate('${x}${y}')
>>> s
'${x}${y}'
>>> s.format(x=1)
'1${y}'
>>> s.format({'y':2})
'12'
>>> print s
12

마찬가지로 기본 문자열 형식을 사용하는 Sven의 답변을 기반으로 한 래퍼 :

class StringTemplate(object):
    class FormatDict(dict):
        def __missing__(self, key):
            return "{" + key + "}"

    def __init__(self, template):
        self.substituted_str = template
        self.formatter = string.Formatter()

    def __repr__(self):
        return self.substituted_str

    def format(self, *args, **kwargs):
        mapping = StringTemplate.FormatDict(*args, **kwargs)
        self.substituted_str = self.formatter.vformat(self.substituted_str, (), mapping)

— 모한 라지
소스

29

이것이 빠른 해결 방법인지 확실하지 않지만 어떻게

s = '{foo} {bar}'
s.format(foo='FOO', bar='{bar}')

? :)

— 멤피스
소스

나는 완전히 똑같이했다. 그렇게 할 때 경고가 있었으면 좋겠다.

— ramgo 2015-04-16

11

메서드 Formatter를 재정의하는 사용자 정의를 정의하면 정의 get_value되지 않은 필드 이름을 원하는대로 매핑하는 데 사용할 수 있습니다.

http://docs.python.org/library/string.html#string.Formatter.get_value

예를 들어, 당신은 매핑 할 수 있습니다 bar에 "{bar}"경우 barkwargs로 아닙니다.

그러나이를 위해서는 format()문자열의 format()메서드가 아닌 Formatter 개체 의 메서드 를 사용해야합니다 .

— 호박색
소스

파이썬> = 2.6 기능처럼 보입니다.

— n611x007

11

>>> 'fd:{uid}:{{topic_id}}'.format(uid=123)
'fd:123:{topic_id}'

이것을 시도하십시오.

— Pengfei.X
소스

와, 정확히 내가 필요한 것! 설명해 주시겠습니까?

— Sergey Chizhik

1

{{그리고 }}그래서, 서식 마크를 탈출하는 방법입니다 format()대체 및 대체합니다 수행하지 않습니다 {{와 }}함께 {및 }각각.

— 7yl4r

이 솔루션의 문제는 더블 {{ }}이 하나의 형식에서만 작동 한다는 것입니다. 더 많은 것을 적용하려면 더 많은 {}. 전의. 'fd:{uid}:{{topic_id}}'.format(uid=123).format(a=1)두 번째 형식이 topic_id값을 제공하지 않기 때문에 오류를 반환 합니다.

— Franzi

7

Amber 의 의견 덕분에 나는 이것을 생각해 냈습니다.

import string

try:
    # Python 3
    from _string import formatter_field_name_split
except ImportError:
    formatter_field_name_split = str._formatter_field_name_split


class PartialFormatter(string.Formatter):
    def get_field(self, field_name, args, kwargs):
        try:
            val = super(PartialFormatter, self).get_field(field_name, args, kwargs)
        except (IndexError, KeyError, AttributeError):
            first, _ = formatter_field_name_split(field_name)
            val = '{' + field_name + '}', first
        return val

— Gatto
소스

파이썬> = 2.6 기능처럼 보입니다.

— n611x007

나는 확실히이 솔루션을 사용하고 있습니다 :) 감사합니다!

— astrojuanlu

2

그들이 존재 (그리고 실제로 반환 값 형식 사양을 적용하면이 변환 및 형식 사양을 잃게된다는 점에 유의 즉, (. {field!s: >4}이된다{field}

— 브랜든 아벨

3

나를 위해 이것은 충분했습니다.

>>> ss = 'dfassf {} dfasfae efaef {} fds'
>>> nn = ss.format('f1', '{}')
>>> nn
'dfassf f1 dfasfae efaef {} fds'
>>> n2 = nn.format('whoa')
>>> n2
'dfassf f1 dfasfae efaef whoa fds'

— 데이비드 베자
소스

3

내가 찾은 모든 솔루션에는 고급 사양 또는 변환 옵션에 문제가있는 것 같습니다. @SvenMarnach의 FormatPlaceholder 는 놀랍도록 영리하지만 강제로 제대로 작동하지 않습니다 (예 : 이 예제에서) 대신 메서드를 {a!s:>2s}호출하고 추가 서식이 손실 되기 때문 입니다.__str____format__

여기에 내가 끝내고 몇 가지 주요 기능이 있습니다.

sformat('The {} is {}', 'answer')
'The answer is {}'

sformat('The answer to {question!r} is {answer:0.2f}', answer=42)
'The answer to {question!r} is 42.00'

sformat('The {} to {} is {:0.{p}f}', 'answer', 'everything', p=4)
'The answer to everything is {:0.4f}'

str.format(단지 매핑이 아닌) 유사한 인터페이스를 제공합니다.
더 복잡한 서식 옵션을 지원합니다.
- 강제 {k!s} {!r}
- 중첩 {k:>{size}}
- getattr {k.foo}
- getitem {k[0]}
- 강제 + 서식 {k!s:>{size}}

import string


class SparseFormatter(string.Formatter):
    """
    A modified string formatter that handles a sparse set of format
    args/kwargs.
    """

    # re-implemented this method for python2/3 compatibility
    def vformat(self, format_string, args, kwargs):
        used_args = set()
        result, _ = self._vformat(format_string, args, kwargs, used_args, 2)
        self.check_unused_args(used_args, args, kwargs)
        return result

    def _vformat(self, format_string, args, kwargs, used_args, recursion_depth,
                 auto_arg_index=0):
        if recursion_depth < 0:
            raise ValueError('Max string recursion exceeded')
        result = []
        for literal_text, field_name, format_spec, conversion in \
                self.parse(format_string):

            orig_field_name = field_name

            # output the literal text
            if literal_text:
                result.append(literal_text)

            # if there's a field, output it
            if field_name is not None:
                # this is some markup, find the object and do
                #  the formatting

                # handle arg indexing when empty field_names are given.
                if field_name == '':
                    if auto_arg_index is False:
                        raise ValueError('cannot switch from manual field '
                                         'specification to automatic field '
                                         'numbering')
                    field_name = str(auto_arg_index)
                    auto_arg_index += 1
                elif field_name.isdigit():
                    if auto_arg_index:
                        raise ValueError('cannot switch from manual field '
                                         'specification to automatic field '
                                         'numbering')
                    # disable auto arg incrementing, if it gets
                    # used later on, then an exception will be raised
                    auto_arg_index = False

                # given the field_name, find the object it references
                #  and the argument it came from
                try:
                    obj, arg_used = self.get_field(field_name, args, kwargs)
                except (IndexError, KeyError):
                    # catch issues with both arg indexing and kwarg key errors
                    obj = orig_field_name
                    if conversion:
                        obj += '!{}'.format(conversion)
                    if format_spec:
                        format_spec, auto_arg_index = self._vformat(
                            format_spec, args, kwargs, used_args,
                            recursion_depth, auto_arg_index=auto_arg_index)
                        obj += ':{}'.format(format_spec)
                    result.append('{' + obj + '}')
                else:
                    used_args.add(arg_used)

                    # do any conversion on the resulting object
                    obj = self.convert_field(obj, conversion)

                    # expand the format spec, if needed
                    format_spec, auto_arg_index = self._vformat(
                        format_spec, args, kwargs,
                        used_args, recursion_depth-1,
                        auto_arg_index=auto_arg_index)

                    # format the object and append to the result
                    result.append(self.format_field(obj, format_spec))

        return ''.join(result), auto_arg_index


def sformat(s, *args, **kwargs):
    # type: (str, *Any, **Any) -> str
    """
    Sparse format a string.

    Parameters
    ----------
    s : str
    args : *Any
    kwargs : **Any

    Examples
    --------
    >>> sformat('The {} is {}', 'answer')
    'The answer is {}'

    >>> sformat('The answer to {question!r} is {answer:0.2f}', answer=42)
    'The answer to {question!r} is 42.00'

    >>> sformat('The {} to {} is {:0.{p}f}', 'answer', 'everything', p=4)
    'The answer to everything is {:0.4f}'

    Returns
    -------
    str
    """
    return SparseFormatter().format(s, *args, **kwargs)

이 메서드가 어떻게 작동하는지에 대한 테스트를 작성한 후 다양한 구현에서 문제를 발견했습니다. 누군가가 통찰력을 찾으면 아래에 있습니다.

import pytest


def test_auto_indexing():
    # test basic arg auto-indexing
    assert sformat('{}{}', 4, 2) == '42'
    assert sformat('{}{} {}', 4, 2) == '42 {}'


def test_manual_indexing():
    # test basic arg indexing
    assert sformat('{0}{1} is not {1} or {0}', 4, 2) == '42 is not 2 or 4'
    assert sformat('{0}{1} is {3} {1} or {0}', 4, 2) == '42 is {3} 2 or 4'


def test_mixing_manualauto_fails():
    # test mixing manual and auto args raises
    with pytest.raises(ValueError):
        assert sformat('{!r} is {0}{1}', 4, 2)


def test_kwargs():
    # test basic kwarg
    assert sformat('{base}{n}', base=4, n=2) == '42'
    assert sformat('{base}{n}', base=4, n=2, extra='foo') == '42'
    assert sformat('{base}{n} {key}', base=4, n=2) == '42 {key}'


def test_args_and_kwargs():
    # test mixing args/kwargs with leftovers
    assert sformat('{}{k} {v}', 4, k=2) == '42 {v}'

    # test mixing with leftovers
    r = sformat('{}{} is the {k} to {!r}', 4, 2, k='answer')
    assert r == '42 is the answer to {!r}'


def test_coercion():
    # test coercion is preserved for skipped elements
    assert sformat('{!r} {k!r}', '42') == "'42' {k!r}"


def test_nesting():
    # test nesting works with or with out parent keys
    assert sformat('{k:>{size}}', k=42, size=3) == ' 42'
    assert sformat('{k:>{size}}', size=3) == '{k:>3}'


@pytest.mark.parametrize(
    ('s', 'expected'),
    [
        ('{a} {b}', '1 2.0'),
        ('{z} {y}', '{z} {y}'),
        ('{a} {a:2d} {a:04d} {y:2d} {z:04d}', '1  1 0001 {y:2d} {z:04d}'),
        ('{a!s} {z!s} {d!r}', '1 {z!s} {\'k\': \'v\'}'),
        ('{a!s:>2s} {z!s:>2s}', ' 1 {z!s:>2s}'),
        ('{a!s:>{a}s} {z!s:>{z}s}', '1 {z!s:>{z}s}'),
        ('{a.imag} {z.y}', '0 {z.y}'),
        ('{e[0]:03d} {z[0]:03d}', '042 {z[0]:03d}'),
    ],
    ids=[
        'normal',
        'none',
        'formatting',
        'coercion',
        'formatting+coercion',
        'nesting',
        'getattr',
        'getitem',
    ]
)
def test_sformat(s, expected):
    # test a bunch of random stuff
    data = dict(
        a=1,
        b=2.0,
        c='3',
        d={'k': 'v'},
        e=[42],
    )
    assert expected == sformat(s, **data)

— 샘 본
소스

@SvenMarnach 코드와 비슷하지만 테스트를 위해 강제를 올바르게 처리하는 답변을 추가했습니다.

— Tohiko

1

내 제안은 다음과 같습니다 (Python3.6으로 테스트 됨).

class Lazymap(object):
       def __init__(self, **kwargs):
           self.dict = kwargs

       def __getitem__(self, key):
           return self.dict.get(key, "".join(["{", key, "}"]))


s = '{foo} {bar}'

s.format_map(Lazymap(bar="FOO"))
# >>> '{foo} FOO'

s.format_map(Lazymap(bar="BAR"))
# >>> '{foo} BAR'

s.format_map(Lazymap(bar="BAR", foo="FOO", baz="BAZ"))
# >>> 'FOO BAR'

업데이트 : 더 우아한 방법 (서브 클래 싱 dict및 오버로딩 __missing__(self, key))이 여기에 표시됩니다 : https://stackoverflow.com/a/17215533/333403

— cknoll
소스

0

완전히 채워질 때까지 문자열을 사용하지 않는다고 가정하면 다음 클래스와 같이 할 수 있습니다.

class IncrementalFormatting:
    def __init__(self, string):
        self._args = []
        self._kwargs = {}
        self._string = string

    def add(self, *args, **kwargs):
        self._args.extend(args)
        self._kwargs.update(kwargs)

    def get(self):
        return self._string.format(*self._args, **self._kwargs)

예:

template = '#{a}:{}/{}?{c}'
message = IncrementalFormatting(template)
message.add('abc')
message.add('xyz', a=24)
message.add(c='lmno')
assert message.get() == '#24:abc/xyz?lmno'

— 브렛 비티
소스

0

이를 달성하는 한 가지 방법이 더 있습니다. 즉, 변수를 사용 format하고 %대체하는 것입니다. 예를 들면 :

>>> s = '{foo} %(bar)s'
>>> s = s.format(foo='my_foo')
>>> s
'my_foo %(bar)s'
>>> s % {'bar': 'my_bar'}
'my_foo my_bar'

— 모이 누딘 콰 드리
소스

0

매우 추악하지만 가장 간단한 해결책은 다음과 같습니다.

tmpl = '{foo}, {bar}'
tmpl.replace('{bar}', 'BAR')
Out[3]: '{foo}, BAR'

이런 식으로 여전히 tmpl일반 템플릿으로 사용 하고 필요한 경우에만 부분 서식을 수행 할 수 있습니다. Mohan Raj와 같은 과도한 솔루션을 사용하기에는이 문제가 너무 사소하다고 생각합니다.

— michcio1234
소스

0

가장 유망한 솔루션을 테스트 한 후 이곳을 하고 있다 , 나는 그들 중 어느 것도 실현되지는 정말 다음과 같은 요구 사항을 충족 :

에 의해 인식되는 구문을 엄격히 준수 str.format_map()템플릿 대해에서 .
복잡한 서식을 유지할 수 있습니다. 즉 Format Mini-Language를 완벽하게 지원합니다.

그래서 위의 요구 사항을 충족하는 자체 솔루션을 작성했습니다. ( 편집하다 : 이제 @SvenMarnach의 버전-이 답변에서보고 된 바와 같이-필요한 코너 케이스를 처리하는 것 같습니다).

기본적으로 템플릿 문자열을 구문 분석하고 일치하는 중첩 {.*?}그룹을 찾고 ( find_all()도우미 함수 사용) 형식이 지정된 문자열을 점진적으로 직접 사용 str.format_map()하면서 잠재적 인 KeyError.

def find_all(
        text,
        pattern,
        overlap=False):
    """
    Find all occurrencies of the pattern in the text.

    Args:
        text (str|bytes|bytearray): The input text.
        pattern (str|bytes|bytearray): The pattern to find.
        overlap (bool): Detect overlapping patterns.

    Yields:
        position (int): The position of the next finding.
    """
    len_text = len(text)
    offset = 1 if overlap else (len(pattern) or 1)
    i = 0
    while i < len_text:
        i = text.find(pattern, i)
        if i >= 0:
            yield i
            i += offset
        else:
            break

def matching_delimiters(
        text,
        l_delim,
        r_delim,
        including=True):
    """
    Find matching delimiters in a sequence.

    The delimiters are matched according to nesting level.

    Args:
        text (str|bytes|bytearray): The input text.
        l_delim (str|bytes|bytearray): The left delimiter.
        r_delim (str|bytes|bytearray): The right delimiter.
        including (bool): Include delimeters.

    yields:
        result (tuple[int]): The matching delimiters.
    """
    l_offset = len(l_delim) if including else 0
    r_offset = len(r_delim) if including else 0
    stack = []

    l_tokens = set(find_all(text, l_delim))
    r_tokens = set(find_all(text, r_delim))
    positions = l_tokens.union(r_tokens)
    for pos in sorted(positions):
        if pos in l_tokens:
            stack.append(pos + 1)
        elif pos in r_tokens:
            if len(stack) > 0:
                prev = stack.pop()
                yield (prev - l_offset, pos + r_offset, len(stack))
            else:
                raise ValueError(
                    'Found `{}` unmatched right token(s) `{}` (position: {}).'
                        .format(len(r_tokens) - len(l_tokens), r_delim, pos))
    if len(stack) > 0:
        raise ValueError(
            'Found `{}` unmatched left token(s) `{}` (position: {}).'
                .format(
                len(l_tokens) - len(r_tokens), l_delim, stack.pop() - 1))

def safe_format_map(
        text,
        source):
    """
    Perform safe string formatting from a mapping source.

    If a value is missing from source, this is simply ignored, and no
    `KeyError` is raised.

    Args:
        text (str): Text to format.
        source (Mapping|None): The mapping to use as source.
            If None, uses caller's `vars()`.

    Returns:
        result (str): The formatted text.
    """
    stack = []
    for i, j, depth in matching_delimiters(text, '{', '}'):
        if depth == 0:
            try:
                replacing = text[i:j].format_map(source)
            except KeyError:
                pass
            else:
                stack.append((i, j, replacing))
    result = ''
    i, j = len(text), 0
    while len(stack) > 0:
        last_i = i
        i, j, replacing = stack.pop()
        result = replacing + text[j:last_i] + result
    if i > 0:
        result = text[0:i] + result
    return result

(이 코드는 FlyingCircus 에서도 사용할 수 있습니다. 면책 조항 : 저는 이 코드 의 주 저자입니다.)

이 코드의 사용법은 다음과 같습니다.

print(safe_format_map('{a} {b} {c}', dict(a=-A-)))
# -A- {b} {c}

의는 (친절하게 자신의 코드를 공유 @SvenMarnach으로 내가 가장 좋아하는 솔루션이 비교하자 여기 와 거기에 ) :

import string


class FormatPlaceholder:
    def __init__(self, key):
        self.key = key
    def __format__(self, spec):
        result = self.key
        if spec:
            result += ":" + spec
        return "{" + result + "}"
    def __getitem__(self, index):
        self.key = "{}[{}]".format(self.key, index)
        return self
    def __getattr__(self, attr):
        self.key = "{}.{}".format(self.key, attr)
        return self


class FormatDict(dict):
    def __missing__(self, key):
        return FormatPlaceholder(key)


def safe_format_alt(text, source):
    formatter = string.Formatter()
    return formatter.vformat(text, (), FormatDict(source))

다음은 몇 가지 테스트입니다.

test_texts = (
    '{b} {f}',  # simple nothing useful in source
    '{a} {b}',  # simple
    '{a} {b} {c:5d}',  # formatting
    '{a} {b} {c!s}',  # coercion
    '{a} {b} {c!s:>{a}s}',  # formatting and coercion
    '{a} {b} {c:0{a}d}',  # nesting
    '{a} {b} {d[x]}',  # dicts (existing in source)
    '{a} {b} {e.index}',  # class (existing in source)
    '{a} {b} {f[g]}',  # dict (not existing in source)
    '{a} {b} {f.values}',  # class (not existing in source)

)
source = dict(a=4, c=101, d=dict(x='FOO'), e=[])

실행하는 코드 :

funcs = safe_format_map, safe_format_alt

n = 18
for text in test_texts:
    full_source = {**dict(b='---', f=dict(g='Oh yes!')), **source}
    print('{:>{n}s} :   OK   : '.format('str.format_map', n=n) + text.format_map(full_source))
    for func in funcs:
        try:
            print(f'{func.__name__:>{n}s} :   OK   : ' + func(text, source))
        except:
            print(f'{func.__name__:>{n}s} : FAILED : {text}')

를 야기하는:

    str.format_map :   OK   : --- {'g': 'Oh yes!'}
   safe_format_map :   OK   : {b} {f}
   safe_format_alt :   OK   : {b} {f}
    str.format_map :   OK   : 4 ---
   safe_format_map :   OK   : 4 {b}
   safe_format_alt :   OK   : 4 {b}
    str.format_map :   OK   : 4 ---   101
   safe_format_map :   OK   : 4 {b}   101
   safe_format_alt :   OK   : 4 {b}   101
    str.format_map :   OK   : 4 --- 101
   safe_format_map :   OK   : 4 {b} 101
   safe_format_alt :   OK   : 4 {b} 101
    str.format_map :   OK   : 4 ---  101
   safe_format_map :   OK   : 4 {b}  101
   safe_format_alt :   OK   : 4 {b}  101
    str.format_map :   OK   : 4 --- 0101
   safe_format_map :   OK   : 4 {b} 0101
   safe_format_alt :   OK   : 4 {b} 0101
    str.format_map :   OK   : 4 --- FOO
   safe_format_map :   OK   : 4 {b} FOO
   safe_format_alt :   OK   : 4 {b} FOO
    str.format_map :   OK   : 4 --- <built-in method index of list object at 0x7f7a485666c8>
   safe_format_map :   OK   : 4 {b} <built-in method index of list object at 0x7f7a485666c8>
   safe_format_alt :   OK   : 4 {b} <built-in method index of list object at 0x7f7a485666c8>
    str.format_map :   OK   : 4 --- Oh yes!
   safe_format_map :   OK   : 4 {b} {f[g]}
   safe_format_alt :   OK   : 4 {b} {f[g]}
    str.format_map :   OK   : 4 --- <built-in method values of dict object at 0x7f7a485da090>
   safe_format_map :   OK   : 4 {b} {f.values}
   safe_format_alt :   OK   : 4 {b} {f.values}

보시다시피, 업데이트 된 버전은 이전 버전이 실패했던 코너 케이스를 잘 처리하는 것 같습니다.

시간적으로, 그들은 약. 서로의 50 %는 실제 text형식 (및 실제 source)에 따라 다르지만 safe_format_map()내가 수행 한 대부분의 테스트에서 우위를 차지하는 것 같습니다 (물론 의미하는 바가 무엇이든간에).

for text in test_texts:
    print(f'  {text}')
    %timeit safe_format(text * 1000, source)
    %timeit safe_format_alt(text * 1000, source)

  {b} {f}
3.93 ms ± 153 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
6.35 ms ± 51.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b}
4.37 ms ± 57.1 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
5.2 ms ± 159 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {c:5d}
7.15 ms ± 91.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
7.76 ms ± 69.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {c!s}
7.04 ms ± 138 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
7.56 ms ± 161 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {c!s:>{a}s}
8.91 ms ± 113 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
10.5 ms ± 181 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {c:0{a}d}
8.84 ms ± 147 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
10.2 ms ± 202 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {d[x]}
7.01 ms ± 197 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
7.35 ms ± 106 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {e.index}
11 ms ± 68.8 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
8.78 ms ± 405 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {f[g]}
6.55 ms ± 88.6 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
9.12 ms ± 159 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
  {a} {b} {f.values}
6.61 ms ± 55.9 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)
9.92 ms ± 98.8 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

— 노록 2
소스

참고 {d[x]}로 내가 아는 유효한 형식 문자열이 아닙니다.

— Sven Marnach

@SvenMarnach 공식 문서는 명시 적으로 말해 field_name ::= arg_name ("." attribute_name | "[" element_index "]")*모두 str.format()와 str.format_map()이해. 이것이 유효한 형식 문자열이라는 충분한 증거가 있다고 말하고 싶습니다.

— norok2 19

str.format()대괄호 안에 정수가 아닌 인덱스 를 사용하는 예제를 제공 할 수 있습니까 ? 정수 인덱스 만 작동하도록 만들 수 있습니다.

— Sven Marnach

@SvenMarnach a = dict(b='YAY!'); '{a[b]}'.format_map(dict(a=a))는` 'YAY!'

— norok2

1

아, 알겠습니다. 나는 이것이 a[b]파이썬 코드 에서처럼 해석된다고 가정 했지만 실제로는 a["b"]감사합니다!

— Sven Marnach

0

당신이 인수를 전달하는 사전 풀고 싶은 경우 format, 이 관련 문제로를 다음과 같은 방법을 사용할 수 있습니다.

먼저 문자열 s이이 질문에서와 같다고 가정합니다 .

s = '{foo} {bar}'

값은 다음 사전에 의해 제공됩니다.

replacements = {'foo': 'FOO'}

분명히 이것은 작동하지 않습니다.

s.format(**replacements)
#---------------------------------------------------------------------------
#KeyError                                  Traceback (most recent call last)
#<ipython-input-29-ef5e51de79bf> in <module>()
#----> 1 s.format(**replacements)
#
#KeyError: 'bar'

그러나 먼저 모든 명명 된 인수를 가져 와서sets 중괄호로 묶인 자체에 인수를 매핑하는 사전을 만들 수 있습니다.

from string import Formatter
args = {x[1]:'{'+x[1]+'}' for x in Formatter().parse(s)}
print(args)
#{'foo': '{foo}', 'bar': '{bar}'}

이제 args사전을 사용하여 누락 된 키를 replacements. Python 3.5 이상에서는 단일 표현식으로이 작업을 수행 할 수 있습니다 .

new_s = s.format(**{**args, **replacements}}
print(new_s)
#'FOO {bar}'

이전 버전의 Python의 경우 다음을 호출 할 수 있습니다 update.

args.update(replacements)
print(s.format(**args))
#'FOO {bar}'

— Pault
소스

0

나는 @ sven-marnach 대답을 좋아합니다. 내 대답은 단순히 확장 버전입니다. 비 키워드 형식을 허용하고 추가 키를 무시합니다. 다음은 사용 예입니다 (함수 이름은 python 3.6 f-string 형식에 대한 참조입니다).

# partial string substitution by keyword
>>> f('{foo} {bar}', foo="FOO")
'FOO {bar}'

# partial string substitution by argument
>>> f('{} {bar}', 1)
'1 {bar}'

>>> f('{foo} {}', 1)
'{foo} 1'

# partial string substitution with arguments and keyword mixed
>>> f('{foo} {} {bar} {}', '|', bar='BAR')
'{foo} | BAR {}'

# partial string substitution with extra keyword
>>> f('{foo} {bar}', foo="FOO", bro="BRO")
'FOO {bar}'

# you can simply 'pour out' your dictionary to format function
>>> kwargs = {'foo': 'FOO', 'bro': 'BRO'}
>>> f('{foo} {bar}', **kwargs)
'FOO {bar}'

그리고 여기에 내 코드가 있습니다.

from string import Formatter


class FormatTuple(tuple):
    def __getitem__(self, key):
        if key + 1 > len(self):
            return "{}"
        return tuple.__getitem__(self, key)


class FormatDict(dict):
    def __missing__(self, key):
        return "{" + key + "}"


def f(string, *args, **kwargs):
    """
    String safe substitute format method.
    If you pass extra keys they will be ignored.
    If you pass incomplete substitute map, missing keys will be left unchanged.
    :param string:
    :param kwargs:
    :return:

    >>> f('{foo} {bar}', foo="FOO")
    'FOO {bar}'
    >>> f('{} {bar}', 1)
    '1 {bar}'
    >>> f('{foo} {}', 1)
    '{foo} 1'
    >>> f('{foo} {} {bar} {}', '|', bar='BAR')
    '{foo} | BAR {}'
    >>> f('{foo} {bar}', foo="FOO", bro="BRO")
    'FOO {bar}'
    """
    formatter = Formatter()
    args_mapping = FormatTuple(args)
    mapping = FormatDict(kwargs)
    return formatter.vformat(string, args_mapping, mapping)

— egvo
소스

0

많은 템플릿을 작성하고 Python에 내장 된 문자열 템플릿 기능이 불충분하거나 어색하다고 생각한다면 Jinja2를 살펴보십시오 .

문서에서 :

Jinja는 Django의 템플릿을 모델로 한 Python 용 현대적이고 디자이너 친화적 인 템플릿 언어입니다.

— 비토
소스

0

@Sam Bourne 주석을 읽고 , 사용자 지정 파서를 작성하지 않고 강제 (예 :) 와 함께 제대로 작동하도록 @SvenMarnach의 코드 를 수정했습니다 {a!s:>2s}. 기본 아이디어는 문자열로 변환하는 것이 아니라 누락 된 키를 강제 태그로 연결하는 것입니다.

import string
class MissingKey(object):
    def __init__(self, key):
        self.key = key

    def __str__(self):  # Supports {key!s}
        return MissingKeyStr("".join([self.key, "!s"]))

    def __repr__(self):  # Supports {key!r}
        return MissingKeyStr("".join([self.key, "!r"]))

    def __format__(self, spec): # Supports {key:spec}
        if spec:
            return "".join(["{", self.key, ":", spec, "}"])
        return "".join(["{", self.key, "}"])

    def __getitem__(self, i): # Supports {key[i]}
        return MissingKey("".join([self.key, "[", str(i), "]"]))

    def __getattr__(self, name): # Supports {key.name}
        return MissingKey("".join([self.key, ".", name]))


class MissingKeyStr(MissingKey, str):
    def __init__(self, key):
        if isinstance(key, MissingKey):
            self.key = "".join([key.key, "!s"])
        else:
            self.key = key

class SafeFormatter(string.Formatter):
    def __init__(self, default=lambda k: MissingKey(k)):
        self.default=default

    def get_value(self, key, args, kwds):
        if isinstance(key, str):
            return kwds.get(key, self.default(key))
        else:
            return super().get_value(key, args, kwds)

예를 들어 다음과 같이 사용하십시오.

SafeFormatter().format("{a:<5} {b:<10}", a=10)

다음 테스트 (@ norok2의 테스트에서 영감을 얻음)는 올바른 키워드를 제공하거나 제공하지 않는 두 가지 경우에 위의 클래스를 기반으로 기존 format_map및 a에 대한 출력을 확인 safe_format_map합니다.

def safe_format_map(text, source):
    return SafeFormatter().format(text, **source)

test_texts = (
    '{a} ',             # simple nothing useful in source
    '{a:5d}',       # formatting
    '{a!s}',        # coercion
    '{a!s:>{a}s}',  # formatting and coercion
    '{a:0{a}d}',    # nesting
    '{d[x]}',       # indexing
    '{d.values}',   # member
)

source = dict(a=10,d=dict(x='FOO'))
funcs = [safe_format_map,
         str.format_map
         #safe_format_alt  # Version based on parsing (See @norok2)
         ]
n = 18
for text in test_texts:
    # full_source = {**dict(b='---', f=dict(g='Oh yes!')), **source}
    # print('{:>{n}s} :   OK   : '.format('str.format_map', n=n) + text.format_map(full_source))
    print("Testing:", text)
    for func in funcs:
        try:
            print(f'{func.__name__:>{n}s} : OK\t\t\t: ' + func(text, dict()))
        except:
            print(f'{func.__name__:>{n}s} : FAILED')

        try:
            print(f'{func.__name__:>{n}s} : OK\t\t\t: ' + func(text, source))
        except:
            print(f'{func.__name__:>{n}s} : FAILED')

어떤 출력

Testing: {a} 
   safe_format_map : OK         : {a} 
   safe_format_map : OK         : 10 
        format_map : FAILED
        format_map : OK         : 10 
Testing: {a:5d}
   safe_format_map : OK         : {a:5d}
   safe_format_map : OK         :    10
        format_map : FAILED
        format_map : OK         :    10
Testing: {a!s}
   safe_format_map : OK         : {a!s}
   safe_format_map : OK         : 10
        format_map : FAILED
        format_map : OK         : 10
Testing: {a!s:>{a}s}
   safe_format_map : OK         : {a!s:>{a}s}
   safe_format_map : OK         :         10
        format_map : FAILED
        format_map : OK         :         10
Testing: {a:0{a}d}
   safe_format_map : OK         : {a:0{a}d}
   safe_format_map : OK         : 0000000010
        format_map : FAILED
        format_map : OK         : 0000000010
Testing: {d[x]}
   safe_format_map : OK         : {d[x]}
   safe_format_map : OK         : FOO
        format_map : FAILED
        format_map : OK         : FOO
Testing: {d.values}
   safe_format_map : OK         : {d.values}
   safe_format_map : OK         : <built-in method values of dict object at 0x7fe61e230af8>
        format_map : FAILED
        format_map : OK         : <built-in method values of dict object at 0x7fe61e230af8>

— 토 히코
소스

-2

기본 인수를 사용하는 함수로 래핑 할 수 있습니다.

def print_foo_bar(foo='', bar=''):
    s = '{foo} {bar}'
    return s.format(foo=foo, bar=bar)

print_foo_bar(bar='BAR') # ' BAR'

— 트레버
소스

{foo}를 빈 문자열로 대체합니다. 문제는 누락 된 필드를 무시하지 않고 추가 최종 서식을위한 부분 서식에 관한 것입니다.

— egvo